Overview

Dataset statistics

Number of variables57
Number of observations372915
Missing cells10770250
Missing cells (%)50.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory162.2 MiB
Average record size in memory456.0 B

Variable types

Numeric13
Categorical44

Alerts

fiModelDesc has a high cardinality: 4639 distinct valuesHigh cardinality
fiBaseModel has a high cardinality: 1797 distinct valuesHigh cardinality
fiSecondaryDesc has a high cardinality: 163 distinct valuesHigh cardinality
fiModelSeries has a high cardinality: 117 distinct valuesHigh cardinality
fiModelDescriptor has a high cardinality: 134 distinct valuesHigh cardinality
fiProductClassDesc has a high cardinality: 72 distinct valuesHigh cardinality
state has a high cardinality: 53 distinct valuesHigh cardinality
Forks is highly imbalanced (61.3%)Imbalance
Pad_Type is highly imbalanced (69.7%)Imbalance
Transmission is highly imbalanced (61.7%)Imbalance
Turbocharged is highly imbalanced (71.0%)Imbalance
Blade_Extension is highly imbalanced (84.1%)Imbalance
Enclosure_Type is highly imbalanced (54.9%)Imbalance
Engine_Horsepower is highly imbalanced (68.9%)Imbalance
Coupler is highly imbalanced (56.1%)Imbalance
Coupler_System is highly imbalanced (61.6%)Imbalance
Grouser_Tracks is highly imbalanced (63.4%)Imbalance
Hydraulics_Flow is highly imbalanced (92.9%)Imbalance
Undercarriage_Pad_Width is highly imbalanced (68.5%)Imbalance
Stick_Length is highly imbalanced (69.5%)Imbalance
Pattern_Changer is highly imbalanced (69.8%)Imbalance
Grouser_Type is highly imbalanced (60.9%)Imbalance
Backhoe_Mounting is highly imbalanced (99.6%)Imbalance
Travel_Controls is highly imbalanced (72.6%)Imbalance
Differential_Type is highly imbalanced (92.0%)Imbalance
Steering_Controls is highly imbalanced (95.9%)Imbalance
auctioneerID has 17459 (4.7%) missing valuesMissing
MachineHoursCurrentMeter has 242638 (65.1%) missing valuesMissing
UsageBand has 304898 (81.8%) missing valuesMissing
fiSecondaryDesc has 122856 (32.9%) missing valuesMissing
fiModelSeries has 323889 (86.9%) missing valuesMissing
fiModelDescriptor has 302791 (81.2%) missing valuesMissing
ProductSize has 197516 (53.0%) missing valuesMissing
Drive_System has 273743 (73.4%) missing valuesMissing
Forks has 191074 (51.2%) missing valuesMissing
Pad_Type has 296794 (79.6%) missing valuesMissing
Ride_Control has 231900 (62.2%) missing valuesMissing
Stick has 296794 (79.6%) missing valuesMissing
Transmission has 199883 (53.6%) missing valuesMissing
Turbocharged has 296794 (79.6%) missing valuesMissing
Blade_Extension has 349872 (93.8%) missing valuesMissing
Blade_Width has 349872 (93.8%) missing valuesMissing
Enclosure_Type has 349872 (93.8%) missing valuesMissing
Engine_Horsepower has 349872 (93.8%) missing valuesMissing
Hydraulics has 77373 (20.7%) missing valuesMissing
Pushblock has 349872 (93.8%) missing valuesMissing
Ripper has 275994 (74.0%) missing valuesMissing
Scarifier has 349861 (93.8%) missing valuesMissing
Tip_Control has 349872 (93.8%) missing valuesMissing
Tire_Size has 284993 (76.4%) missing valuesMissing
Coupler has 176136 (47.2%) missing valuesMissing
Coupler_System has 332096 (89.1%) missing valuesMissing
Grouser_Tracks has 332180 (89.1%) missing valuesMissing
Hydraulics_Flow has 332180 (89.1%) missing valuesMissing
Track_Type has 283243 (76.0%) missing valuesMissing
Undercarriage_Pad_Width has 282723 (75.8%) missing valuesMissing
Stick_Length has 283187 (75.9%) missing valuesMissing
Thumb has 283129 (75.9%) missing valuesMissing
Pattern_Changer has 283187 (75.9%) missing valuesMissing
Grouser_Type has 283243 (76.0%) missing valuesMissing
Backhoe_Mounting has 299233 (80.2%) missing valuesMissing
Blade_Type has 298372 (80.0%) missing valuesMissing
Travel_Controls has 298367 (80.0%) missing valuesMissing
Differential_Type has 308097 (82.6%) missing valuesMissing
Steering_Controls has 308133 (82.6%) missing valuesMissing
SalesID has unique valuesUnique
MachineHoursCurrentMeter has 62260 (16.7%) zerosZeros
saleDayOfWeek has 22103 (5.9%) zerosZeros

Reproduction

Analysis started2023-06-15 16:24:01.931413
Analysis finished2023-06-15 16:25:36.004141
Duration1 minute and 34.07 seconds
Software versionpandas-profiling v0.0.dev0
Download configurationconfig.json

Variables

SalesID
Real number (ℝ)

Distinct372915
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010255.6
Minimum1139246
Maximum6333349
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:36.109250image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1139246
5-th percentile1217768.1
Q11415891.5
median1630519
Q32259277.5
95-th percentile4359822.3
Maximum6333349
Range5194103
Interquartile range (IQR)843386

Descriptive statistics

Standard deviation1094494.7
Coefficient of variation (CV)0.54445548
Kurtosis7.0493974
Mean2010255.6
Median Absolute Deviation (MAD)261434
Skewness2.6719756
Sum7.4965448 × 1011
Variance1.1979186 × 1012
MonotonicityNot monotonic
2023-06-15T17:25:36.246603image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1646770 1
 
< 0.1%
1181577 1
 
< 0.1%
2404870 1
 
< 0.1%
1181578 1
 
< 0.1%
1181801 1
 
< 0.1%
1181839 1
 
< 0.1%
1181840 1
 
< 0.1%
2196192 1
 
< 0.1%
2438318 1
 
< 0.1%
2335112 1
 
< 0.1%
Other values (372905) 372905
> 99.9%
ValueCountFrequency (%)
1139246 1
< 0.1%
1139248 1
< 0.1%
1139249 1
< 0.1%
1139251 1
< 0.1%
1139253 1
< 0.1%
1139255 1
< 0.1%
1139256 1
< 0.1%
1139261 1
< 0.1%
1139272 1
< 0.1%
1139278 1
< 0.1%
ValueCountFrequency (%)
6333349 1
< 0.1%
6333348 1
< 0.1%
6333347 1
< 0.1%
6333345 1
< 0.1%
6333344 1
< 0.1%
6333343 1
< 0.1%
6333342 1
< 0.1%
6333341 1
< 0.1%
6333339 1
< 0.1%
6333338 1
< 0.1%

SalePrice
Real number (ℝ)

Distinct933
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32330.512
Minimum4750
Maximum142000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:36.360779image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum4750
5-th percentile8500
Q115000
median25000
Q342000
95-th percentile83000
Maximum142000
Range137250
Interquartile range (IQR)27000

Descriptive statistics

Standard deviation23528.219
Coefficient of variation (CV)0.72774038
Kurtosis1.9123597
Mean32330.512
Median Absolute Deviation (MAD)12000
Skewness1.4547385
Sum1.2056533 × 1010
Variance5.5357708 × 108
MonotonicityNot monotonic
2023-06-15T17:25:36.471584image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25000 7242
 
1.9%
20000 6913
 
1.9%
26000 6696
 
1.8%
15000 6573
 
1.8%
14000 6277
 
1.7%
16000 6275
 
1.7%
17000 6178
 
1.7%
18000 6118
 
1.6%
19000 6110
 
1.6%
24000 6110
 
1.6%
Other values (923) 308423
82.7%
ValueCountFrequency (%)
4750 153
 
< 0.1%
4800 16
 
< 0.1%
4850 4
 
< 0.1%
4900 17
 
< 0.1%
4935 1
 
< 0.1%
4950 2
 
< 0.1%
4987 2
 
< 0.1%
5000 703
0.2%
5100 35
 
< 0.1%
5150 1
 
< 0.1%
ValueCountFrequency (%)
142000 5
 
< 0.1%
141000 13
 
< 0.1%
140000 103
< 0.1%
139000 6
 
< 0.1%
138900 1
 
< 0.1%
138000 11
 
< 0.1%
137500 53
< 0.1%
137000 14
 
< 0.1%
136000 18
 
< 0.1%
135000 94
< 0.1%

MachineID
Real number (ℝ)

Distinct318066
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1222473.6
Minimum0
Maximum2486276
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:36.592609image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile201470.9
Q11084650.5
median1281261
Q31475833
95-th percentile1873247
Maximum2486276
Range2486276
Interquartile range (IQR)391182.5

Descriptive statistics

Standard deviation463304.2
Coefficient of variation (CV)0.37898912
Kurtosis0.78047048
Mean1222473.6
Median Absolute Deviation (MAD)195583
Skewness-0.59929103
Sum4.5587874 × 1011
Variance2.1465078 × 1011
MonotonicityNot monotonic
2023-06-15T17:25:36.711860image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2283592 46
 
< 0.1%
2285830 44
 
< 0.1%
1896854 39
 
< 0.1%
1746392 34
 
< 0.1%
2268800 31
 
< 0.1%
1942724 29
 
< 0.1%
2282547 29
 
< 0.1%
2208545 27
 
< 0.1%
2300370 27
 
< 0.1%
2293171 26
 
< 0.1%
Other values (318056) 372583
99.9%
ValueCountFrequency (%)
0 2
< 0.1%
2 1
< 0.1%
13 1
< 0.1%
17 1
< 0.1%
52 1
< 0.1%
63 1
< 0.1%
66 1
< 0.1%
102 2
< 0.1%
113 1
< 0.1%
116 1
< 0.1%
ValueCountFrequency (%)
2486276 1
< 0.1%
2486275 1
< 0.1%
2486274 1
< 0.1%
2486273 1
< 0.1%
2486111 1
< 0.1%
2486110 1
< 0.1%
2485633 1
< 0.1%
2484864 1
< 0.1%
2484863 1
< 0.1%
2484862 1
< 0.1%

ModelID
Real number (ℝ)

Distinct4833
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6838.2878
Minimum28
Maximum37198
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:36.834173image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum28
5-th percentile802.5
Q13263
median4604
Q38343
95-th percentile22074
Maximum37198
Range37170
Interquartile range (IQR)5080

Descriptive statistics

Standard deviation6202.3953
Coefficient of variation (CV)0.90700997
Kurtosis3.1854999
Mean6838.2878
Median Absolute Deviation (MAD)2409
Skewness1.7880698
Sum2.5501001 × 109
Variance38469707
MonotonicityNot monotonic
2023-06-15T17:25:36.950840image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4605 5306
 
1.4%
3538 4845
 
1.3%
4604 4171
 
1.1%
3170 3950
 
1.1%
3362 3933
 
1.1%
3537 3642
 
1.0%
4603 3296
 
0.9%
3171 3248
 
0.9%
3357 3047
 
0.8%
3178 2954
 
0.8%
Other values (4823) 334523
89.7%
ValueCountFrequency (%)
28 38
 
< 0.1%
29 17
 
< 0.1%
31 12
 
< 0.1%
34 9
 
< 0.1%
43 683
0.2%
47 236
 
0.1%
50 10
 
< 0.1%
53 56
 
< 0.1%
55 3
 
< 0.1%
75 430
0.1%
ValueCountFrequency (%)
37198 2
 
< 0.1%
37197 16
< 0.1%
37196 14
< 0.1%
36933 2
 
< 0.1%
36928 1
 
< 0.1%
36914 1
 
< 0.1%
36894 1
 
< 0.1%
36885 1
 
< 0.1%
36883 1
 
< 0.1%
36880 1
 
< 0.1%

datasource
Real number (ℝ)

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean135.37663
Minimum121
Maximum173
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:37.045842image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum121
5-th percentile121
Q1132
median132
Q3136
95-th percentile149
Maximum173
Range52
Interquartile range (IQR)4

Descriptive statistics

Standard deviation9.8572695
Coefficient of variation (CV)0.072813672
Kurtosis6.517749
Mean135.37663
Median Absolute Deviation (MAD)0
Skewness2.4330397
Sum50483976
Variance97.165762
MonotonicityNot monotonic
2023-06-15T17:25:37.123109image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
132 239231
64.2%
136 64778
 
17.4%
149 30856
 
8.3%
121 20519
 
5.5%
172 17530
 
4.7%
173 1
 
< 0.1%
ValueCountFrequency (%)
121 20519
 
5.5%
132 239231
64.2%
136 64778
 
17.4%
149 30856
 
8.3%
172 17530
 
4.7%
173 1
 
< 0.1%
ValueCountFrequency (%)
173 1
 
< 0.1%
172 17530
 
4.7%
149 30856
 
8.3%
136 64778
 
17.4%
132 239231
64.2%
121 20519
 
5.5%

auctioneerID
Real number (ℝ)

Distinct30
Distinct (%)< 0.1%
Missing17459
Missing (%)4.7%
Infinite0
Infinite (%)0.0%
Mean6.4869013
Minimum0
Maximum99
Zeros431
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:37.215285image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q34
95-th percentile21
Maximum99
Range99
Interquartile range (IQR)3

Descriptive statistics

Standard deviation17.116211
Coefficient of variation (CV)2.6385804
Kurtosis23.071733
Mean6.4869013
Median Absolute Deviation (MAD)0
Skewness4.8327465
Sum2305808
Variance292.96468
MonotonicityNot monotonic
2023-06-15T17:25:37.310245image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 180673
48.4%
2 50296
 
13.5%
3 25541
 
6.8%
4 18290
 
4.9%
6 11570
 
3.1%
99 10841
 
2.9%
7 7004
 
1.9%
8 6424
 
1.7%
5 6300
 
1.7%
10 4958
 
1.3%
Other values (20) 33559
 
9.0%
(Missing) 17459
 
4.7%
ValueCountFrequency (%)
0 431
 
0.1%
1 180673
48.4%
2 50296
 
13.5%
3 25541
 
6.8%
4 18290
 
4.9%
5 6300
 
1.7%
6 11570
 
3.1%
7 7004
 
1.9%
8 6424
 
1.7%
9 3268
 
0.9%
ValueCountFrequency (%)
99 10841
2.9%
28 755
 
0.2%
27 1052
 
0.3%
26 684
 
0.2%
25 861
 
0.2%
24 1337
 
0.4%
23 1162
 
0.3%
22 1042
 
0.3%
21 1280
 
0.3%
20 1965
 
0.5%

YearMade
Real number (ℝ)

Distinct70
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1993.9412
Minimum1937
Maximum2014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:37.418364image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1937
5-th percentile1975
Q11988
median1996
Q32001
95-th percentile2006
Maximum2014
Range77
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.579253
Coefficient of variation (CV)0.0048041802
Kurtosis0.20326391
Mean1993.9412
Median Absolute Deviation (MAD)6
Skewness-0.87515654
Sum7.435706 × 108
Variance91.762089
MonotonicityNot monotonic
2023-06-15T17:25:37.533606image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2005 22048
 
5.9%
1998 21743
 
5.8%
2004 20874
 
5.6%
1999 19263
 
5.2%
1997 19259
 
5.2%
2000 17230
 
4.6%
1996 16993
 
4.6%
1995 15798
 
4.2%
2003 14649
 
3.9%
1994 14456
 
3.9%
Other values (60) 190602
51.1%
ValueCountFrequency (%)
1937 1
 
< 0.1%
1942 1
 
< 0.1%
1947 1
 
< 0.1%
1948 3
 
< 0.1%
1949 1
 
< 0.1%
1950 8
< 0.1%
1951 7
< 0.1%
1952 6
< 0.1%
1953 7
< 0.1%
1954 3
 
< 0.1%
ValueCountFrequency (%)
2014 2
 
< 0.1%
2013 1
 
< 0.1%
2012 1
 
< 0.1%
2011 31
 
< 0.1%
2010 58
 
< 0.1%
2009 212
 
0.1%
2008 1691
 
0.5%
2007 5044
 
1.4%
2006 13409
3.6%
2005 22048
5.9%

MachineHoursCurrentMeter
Real number (ℝ)

MISSING  ZEROS 

Distinct14316
Distinct (%)11.0%
Missing242638
Missing (%)65.1%
Infinite0
Infinite (%)0.0%
Mean2491.069
Minimum0
Maximum185100
Zeros62260
Zeros (%)16.7%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:37.655240image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median380
Q33238
95-th percentile10139.2
Maximum185100
Range185100
Interquartile range (IQR)3238

Descriptive statistics

Standard deviation5745.5617
Coefficient of variation (CV)2.3064643
Kurtosis308.45129
Mean2491.069
Median Absolute Deviation (MAD)380
Skewness13.224764
Sum3.24529 × 108
Variance33011479
MonotonicityNot monotonic
2023-06-15T17:25:37.765335image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 62260
 
16.7%
2000 114
 
< 0.1%
1000 110
 
< 0.1%
1500 96
 
< 0.1%
500 95
 
< 0.1%
24 90
 
< 0.1%
1200 85
 
< 0.1%
800 84
 
< 0.1%
1700 80
 
< 0.1%
1400 79
 
< 0.1%
Other values (14306) 67184
 
18.0%
(Missing) 242638
65.1%
ValueCountFrequency (%)
0 62260
16.7%
1 2
 
< 0.1%
2 18
 
< 0.1%
3 21
 
< 0.1%
4 35
 
< 0.1%
5 43
 
< 0.1%
6 19
 
< 0.1%
7 12
 
< 0.1%
8 16
 
< 0.1%
9 11
 
< 0.1%
ValueCountFrequency (%)
185100 1
< 0.1%
183500 1
< 0.1%
180300 1
< 0.1%
180000 1
< 0.1%
177800 1
< 0.1%
177200 1
< 0.1%
176700 1
< 0.1%
176300 1
< 0.1%
176100 1
< 0.1%
174900 1
< 0.1%

UsageBand
Categorical

Distinct3
Distinct (%)< 0.1%
Missing304898
Missing (%)81.8%
Memory size2.8 MiB
Medium
34384 
Low
21717 
High
11916 

Length

Max length6
Median length6
Mean length4.6917535
Min length3

Characters and Unicode

Total characters319119
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLow
2nd rowLow
3rd rowLow
4th rowMedium
5th rowMedium

Common Values

ValueCountFrequency (%)
Medium 34384
 
9.2%
Low 21717
 
5.8%
High 11916
 
3.2%
(Missing) 304898
81.8%

Length

2023-06-15T17:25:37.872080image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:37.973266image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
medium 34384
50.6%
low 21717
31.9%
high 11916
 
17.5%

Most occurring characters

ValueCountFrequency (%)
i 46300
14.5%
M 34384
10.8%
e 34384
10.8%
d 34384
10.8%
u 34384
10.8%
m 34384
10.8%
L 21717
6.8%
o 21717
6.8%
w 21717
6.8%
H 11916
 
3.7%
Other values (2) 23832
7.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 251102
78.7%
Uppercase Letter 68017
 
21.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 46300
18.4%
e 34384
13.7%
d 34384
13.7%
u 34384
13.7%
m 34384
13.7%
o 21717
8.6%
w 21717
8.6%
g 11916
 
4.7%
h 11916
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
M 34384
50.6%
L 21717
31.9%
H 11916
 
17.5%

Most occurring scripts

ValueCountFrequency (%)
Latin 319119
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 46300
14.5%
M 34384
10.8%
e 34384
10.8%
d 34384
10.8%
u 34384
10.8%
m 34384
10.8%
L 21717
6.8%
o 21717
6.8%
w 21717
6.8%
H 11916
 
3.7%
Other values (2) 23832
7.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 319119
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 46300
14.5%
M 34384
10.8%
e 34384
10.8%
d 34384
10.8%
u 34384
10.8%
m 34384
10.8%
L 21717
6.8%
o 21717
6.8%
w 21717
6.8%
H 11916
 
3.7%
Other values (2) 23832
7.5%

fiModelDesc
Categorical

Distinct4639
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.8 MiB
310G
 
5306
416C
 
4845
310E
 
4171
580K
 
3950
140G
 
3933
Other values (4634)
350710 

Length

Max length19
Median length17
Mean length4.667919
Min length1

Characters and Unicode

Total characters1740737
Distinct characters57
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique802 ?
Unique (%)0.2%

Sample

1st rowTD20
2nd rowA66
3rd rowD7G
4th rowA62
5th rowD3B

Common Values

ValueCountFrequency (%)
310G 5306
 
1.4%
416C 4845
 
1.3%
310E 4171
 
1.1%
580K 3950
 
1.1%
140G 3933
 
1.1%
416B 3657
 
1.0%
310D 3296
 
0.9%
580L 3248
 
0.9%
12G 3047
 
0.8%
580SUPER L 2954
 
0.8%
Other values (4629) 334508
89.7%

Length

2023-06-15T17:25:38.067089image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
580super 5435
 
1.4%
310g 5306
 
1.4%
416c 4845
 
1.3%
310e 4171
 
1.1%
580k 3950
 
1.0%
140g 3933
 
1.0%
416b 3657
 
1.0%
l 3345
 
0.9%
310d 3296
 
0.9%
580l 3249
 
0.9%
Other values (4629) 338535
89.2%

Most occurring characters

ValueCountFrequency (%)
0 206863
 
11.9%
5 133270
 
7.7%
1 123334
 
7.1%
3 111962
 
6.4%
2 110918
 
6.4%
4 94278
 
5.4%
6 88555
 
5.1%
L 88355
 
5.1%
C 87507
 
5.0%
D 79400
 
4.6%
Other values (47) 616295
35.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1025981
58.9%
Uppercase Letter 687099
39.5%
Dash Punctuation 18504
 
1.1%
Space Separator 7148
 
0.4%
Other Punctuation 1909
 
0.1%
Lowercase Letter 63
 
< 0.1%
Math Symbol 25
 
< 0.1%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
L 88355
12.9%
C 87507
12.7%
D 79400
11.6%
G 54550
 
7.9%
I 46707
 
6.8%
E 44942
 
6.5%
B 42959
 
6.3%
P 39583
 
5.8%
S 27330
 
4.0%
H 26952
 
3.9%
Other values (16) 148814
21.7%
Lowercase Letter
ValueCountFrequency (%)
t 10
15.9%
e 8
12.7%
i 7
11.1%
o 6
9.5%
a 6
9.5%
g 4
 
6.3%
h 4
 
6.3%
m 4
 
6.3%
s 3
 
4.8%
r 3
 
4.8%
Other values (4) 8
12.7%
Decimal Number
ValueCountFrequency (%)
0 206863
20.2%
5 133270
13.0%
1 123334
12.0%
3 111962
10.9%
2 110918
10.8%
4 94278
9.2%
6 88555
8.6%
8 66204
 
6.5%
9 45917
 
4.5%
7 44680
 
4.4%
Other Punctuation
ValueCountFrequency (%)
. 1907
99.9%
/ 2
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 18504
100.0%
Space Separator
ValueCountFrequency (%)
7148
100.0%
Math Symbol
ValueCountFrequency (%)
+ 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1053575
60.5%
Latin 687162
39.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
L 88355
12.9%
C 87507
12.7%
D 79400
11.6%
G 54550
 
7.9%
I 46707
 
6.8%
E 44942
 
6.5%
B 42959
 
6.3%
P 39583
 
5.8%
S 27330
 
4.0%
H 26952
 
3.9%
Other values (30) 148877
21.7%
Common
ValueCountFrequency (%)
0 206863
19.6%
5 133270
12.6%
1 123334
11.7%
3 111962
10.6%
2 110918
10.5%
4 94278
8.9%
6 88555
8.4%
8 66204
 
6.3%
9 45917
 
4.4%
7 44680
 
4.2%
Other values (7) 27594
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1740737
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 206863
 
11.9%
5 133270
 
7.7%
1 123334
 
7.1%
3 111962
 
6.4%
2 110918
 
6.4%
4 94278
 
5.4%
6 88555
 
5.1%
L 88355
 
5.1%
C 87507
 
5.0%
D 79400
 
4.6%
Other values (47) 616295
35.4%

fiBaseModel
Categorical

Distinct1797
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size2.8 MiB
580
 
18623
310
 
17417
D6
 
12988
416
 
12585
D5
 
9398
Other values (1792)
301904 

Length

Max length13
Median length3
Mean length3.1776464
Min length1

Characters and Unicode

Total characters1184992
Distinct characters39
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique213 ?
Unique (%)0.1%

Sample

1st rowTD20
2nd rowA66
3rd rowD7
4th rowA62
5th rowD3

Common Values

ValueCountFrequency (%)
580 18623
 
5.0%
310 17417
 
4.7%
D6 12988
 
3.5%
416 12585
 
3.4%
D5 9398
 
2.5%
950 7282
 
2.0%
D3 6803
 
1.8%
D8 6505
 
1.7%
D4 6339
 
1.7%
12 5968
 
1.6%
Other values (1787) 269007
72.1%

Length

2023-06-15T17:25:38.165045image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
580 18623
 
5.0%
310 17417
 
4.7%
d6 12988
 
3.5%
416 12585
 
3.4%
d5 9398
 
2.5%
950 7282
 
2.0%
d3 6803
 
1.8%
d8 6505
 
1.7%
d4 6339
 
1.7%
12 5968
 
1.6%
Other values (1760) 269408
72.2%

Most occurring characters

ValueCountFrequency (%)
0 206296
17.4%
5 128033
10.8%
1 116615
9.8%
3 106726
9.0%
2 104770
8.8%
4 94154
7.9%
6 83069
7.0%
8 65109
 
5.5%
D 60021
 
5.1%
9 45915
 
3.9%
Other values (29) 174284
14.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 992746
83.8%
Uppercase Letter 189875
 
16.0%
Other Punctuation 1876
 
0.2%
Space Separator 401
 
< 0.1%
Dash Punctuation 94
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
D 60021
31.6%
C 17712
 
9.3%
E 13357
 
7.0%
P 13135
 
6.9%
S 12809
 
6.7%
X 12321
 
6.5%
T 11992
 
6.3%
L 9054
 
4.8%
W 7831
 
4.1%
A 6672
 
3.5%
Other values (15) 24971
13.2%
Decimal Number
ValueCountFrequency (%)
0 206296
20.8%
5 128033
12.9%
1 116615
11.7%
3 106726
10.8%
2 104770
10.6%
4 94154
9.5%
6 83069
8.4%
8 65109
 
6.6%
9 45915
 
4.6%
7 42059
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 1875
99.9%
/ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
401
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 995117
84.0%
Latin 189875
 
16.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
D 60021
31.6%
C 17712
 
9.3%
E 13357
 
7.0%
P 13135
 
6.9%
S 12809
 
6.7%
X 12321
 
6.5%
T 11992
 
6.3%
L 9054
 
4.8%
W 7831
 
4.1%
A 6672
 
3.5%
Other values (15) 24971
13.2%
Common
ValueCountFrequency (%)
0 206296
20.7%
5 128033
12.9%
1 116615
11.7%
3 106726
10.7%
2 104770
10.5%
4 94154
9.5%
6 83069
8.3%
8 65109
 
6.5%
9 45915
 
4.6%
7 42059
 
4.2%
Other values (4) 2371
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1184992
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 206296
17.4%
5 128033
10.8%
1 116615
9.8%
3 106726
9.0%
2 104770
8.8%
4 94154
7.9%
6 83069
7.0%
8 65109
 
5.5%
D 60021
 
5.1%
9 45915
 
3.9%
Other values (29) 174284
14.7%

fiSecondaryDesc
Categorical

HIGH CARDINALITY  MISSING 

Distinct163
Distinct (%)0.1%
Missing122856
Missing (%)32.9%
Memory size2.8 MiB
C
42005 
B
37260 
G
35920 
H
23606 
E
19345 
Other values (158)
91923 

Length

Max length13
Median length1
Mean length1.2390556
Min length1

Characters and Unicode

Total characters309837
Distinct characters37
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)< 0.1%

Sample

1st rowG
2nd rowB
3rd rowC
4th rowH
5th rowF

Common Values

ValueCountFrequency (%)
C 42005
 
11.3%
B 37260
 
10.0%
G 35920
 
9.6%
H 23606
 
6.3%
E 19345
 
5.2%
D 19004
 
5.1%
F 8860
 
2.4%
K 7434
 
2.0%
M 5192
 
1.4%
L 5169
 
1.4%
Other values (153) 46264
 
12.4%
(Missing) 122856
32.9%

Length

2023-06-15T17:25:38.257601image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
c 42015
16.4%
b 37315
14.6%
g 35929
14.0%
h 23606
9.2%
e 19570
7.6%
d 19004
7.4%
f 8860
 
3.5%
l 8781
 
3.4%
k 8257
 
3.2%
m 6838
 
2.7%
Other values (139) 46223
18.0%

Most occurring characters

ValueCountFrequency (%)
C 46021
14.9%
B 37919
12.2%
G 37540
12.1%
E 30754
9.9%
H 23973
 
7.7%
D 19177
 
6.2%
L 16009
 
5.2%
S 11742
 
3.8%
R 11524
 
3.7%
P 10539
 
3.4%
Other values (27) 64639
20.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 302794
97.7%
Space Separator 6681
 
2.2%
Decimal Number 287
 
0.1%
Dash Punctuation 40
 
< 0.1%
Other Punctuation 35
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 46021
15.2%
B 37919
12.5%
G 37540
12.4%
E 30754
10.2%
H 23973
7.9%
D 19177
 
6.3%
L 16009
 
5.3%
S 11742
 
3.9%
R 11524
 
3.8%
P 10539
 
3.5%
Other values (14) 57596
19.0%
Decimal Number
ValueCountFrequency (%)
7 124
43.2%
3 113
39.4%
2 22
 
7.7%
5 14
 
4.9%
0 8
 
2.8%
1 4
 
1.4%
6 1
 
0.3%
9 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
# 14
40.0%
? 14
40.0%
. 7
20.0%
Space Separator
ValueCountFrequency (%)
6681
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 302794
97.7%
Common 7043
 
2.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 46021
15.2%
B 37919
12.5%
G 37540
12.4%
E 30754
10.2%
H 23973
7.9%
D 19177
 
6.3%
L 16009
 
5.3%
S 11742
 
3.9%
R 11524
 
3.8%
P 10539
 
3.5%
Other values (14) 57596
19.0%
Common
ValueCountFrequency (%)
6681
94.9%
7 124
 
1.8%
3 113
 
1.6%
- 40
 
0.6%
2 22
 
0.3%
# 14
 
0.2%
? 14
 
0.2%
5 14
 
0.2%
0 8
 
0.1%
. 7
 
0.1%
Other values (3) 6
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 309837
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 46021
14.9%
B 37919
12.2%
G 37540
12.1%
E 30754
9.9%
H 23973
 
7.7%
D 19177
 
6.2%
L 16009
 
5.2%
S 11742
 
3.8%
R 11524
 
3.7%
P 10539
 
3.4%
Other values (27) 64639
20.9%

fiModelSeries
Categorical

HIGH CARDINALITY  MISSING 

Distinct117
Distinct (%)0.2%
Missing323889
Missing (%)86.9%
Memory size2.8 MiB
II
13009 
LC
7731 
III
5099 
-1
3498 
-2
2779 
Other values (112)
16910 

Length

Max length11
Median length2
Mean length2.1981194
Min length1

Characters and Unicode

Total characters107765
Distinct characters46
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)< 0.1%

Sample

1st rowII
2nd rowII
3rd rowII
4th rowII
5th row1

Common Values

ValueCountFrequency (%)
II 13009
 
3.5%
LC 7731
 
2.1%
III 5099
 
1.4%
-1 3498
 
0.9%
-2 2779
 
0.7%
-3 2148
 
0.6%
-6 1998
 
0.5%
-5 1841
 
0.5%
-12 1241
 
0.3%
7 698
 
0.2%
Other values (107) 8984
 
2.4%
(Missing) 323889
86.9%

Length

2023-06-15T17:25:38.371993image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
ii 13009
26.5%
lc 7731
15.8%
iii 5099
 
10.4%
1 3663
 
7.5%
2 3098
 
6.3%
6 2642
 
5.4%
3 2612
 
5.3%
5 2319
 
4.7%
7 1287
 
2.6%
12 1258
 
2.6%
Other values (91) 6308
12.9%

Most occurring characters

ValueCountFrequency (%)
I 41956
38.9%
- 18002
16.7%
L 8880
 
8.2%
C 8667
 
8.0%
1 6674
 
6.2%
2 5691
 
5.3%
3 3565
 
3.3%
5 3249
 
3.0%
6 2914
 
2.7%
7 1571
 
1.5%
Other values (36) 6596
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 63570
59.0%
Decimal Number 25240
 
23.4%
Dash Punctuation 18002
 
16.7%
Other Punctuation 813
 
0.8%
Math Symbol 105
 
0.1%
Lowercase Letter 35
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
I 41956
66.0%
L 8880
 
14.0%
C 8667
 
13.6%
V 1074
 
1.7%
M 684
 
1.1%
E 574
 
0.9%
N 522
 
0.8%
A 421
 
0.7%
T 241
 
0.4%
S 103
 
0.2%
Other values (13) 448
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 6674
26.4%
2 5691
22.5%
3 3565
14.1%
5 3249
12.9%
6 2914
11.5%
7 1571
 
6.2%
8 800
 
3.2%
0 759
 
3.0%
4 16
 
0.1%
9 1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
t 8
22.9%
e 6
17.1%
o 4
11.4%
m 4
11.4%
a 4
11.4%
i 3
 
8.6%
s 3
 
8.6%
r 3
 
8.6%
Other Punctuation
ValueCountFrequency (%)
# 354
43.5%
? 354
43.5%
. 105
 
12.9%
Dash Punctuation
ValueCountFrequency (%)
- 18002
100.0%
Math Symbol
ValueCountFrequency (%)
+ 105
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 63605
59.0%
Common 44160
41.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
I 41956
66.0%
L 8880
 
14.0%
C 8667
 
13.6%
V 1074
 
1.7%
M 684
 
1.1%
E 574
 
0.9%
N 522
 
0.8%
A 421
 
0.7%
T 241
 
0.4%
S 103
 
0.2%
Other values (21) 483
 
0.8%
Common
ValueCountFrequency (%)
- 18002
40.8%
1 6674
 
15.1%
2 5691
 
12.9%
3 3565
 
8.1%
5 3249
 
7.4%
6 2914
 
6.6%
7 1571
 
3.6%
8 800
 
1.8%
0 759
 
1.7%
# 354
 
0.8%
Other values (5) 581
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 107765
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
I 41956
38.9%
- 18002
16.7%
L 8880
 
8.2%
C 8667
 
8.0%
1 6674
 
6.2%
2 5691
 
5.3%
3 3565
 
3.3%
5 3249
 
3.0%
6 2914
 
2.7%
7 1571
 
1.5%
Other values (36) 6596
 
6.1%

fiModelDescriptor
Categorical

HIGH CARDINALITY  MISSING 

Distinct134
Distinct (%)0.2%
Missing302791
Missing (%)81.2%
Memory size2.8 MiB
L
15809 
LGP
15627 
LC
12310 
XL
6503 
6
2557 
Other values (129)
17318 

Length

Max length14
Median length10
Mean length1.9043551
Min length1

Characters and Unicode

Total characters133541
Distinct characters50
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)< 0.1%

Sample

1st rowLC
2nd rowLGP
3rd rowLGP
4th rowLGP
5th rowLC

Common Values

ValueCountFrequency (%)
L 15809
 
4.2%
LGP 15627
 
4.2%
LC 12310
 
3.3%
XL 6503
 
1.7%
6 2557
 
0.7%
LT 2361
 
0.6%
5 1956
 
0.5%
CR 1772
 
0.5%
3 1454
 
0.4%
H 991
 
0.3%
Other values (124) 8784
 
2.4%
(Missing) 302791
81.2%

Length

2023-06-15T17:25:38.485971image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
l 15809
22.5%
lgp 15636
22.3%
lc 12310
17.6%
xl 6503
9.3%
6 2557
 
3.6%
lt 2361
 
3.4%
5 1956
 
2.8%
cr 1772
 
2.5%
3 1454
 
2.1%
h 991
 
1.4%
Other values (122) 8780
12.5%

Most occurring characters

ValueCountFrequency (%)
L 54412
40.7%
P 15858
 
11.9%
G 15723
 
11.8%
C 15107
 
11.3%
X 7412
 
5.6%
T 4108
 
3.1%
R 3856
 
2.9%
S 2676
 
2.0%
6 2559
 
1.9%
5 1965
 
1.5%
Other values (40) 9865
 
7.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 125100
93.7%
Decimal Number 8223
 
6.2%
Other Punctuation 77
 
0.1%
Math Symbol 76
 
0.1%
Space Separator 29
 
< 0.1%
Lowercase Letter 28
 
< 0.1%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
L 54412
43.5%
P 15858
 
12.7%
G 15723
 
12.6%
C 15107
 
12.1%
X 7412
 
5.9%
T 4108
 
3.3%
R 3856
 
3.1%
S 2676
 
2.1%
H 1244
 
1.0%
Z 1015
 
0.8%
Other values (14) 3689
 
2.9%
Lowercase Letter
ValueCountFrequency (%)
h 4
14.3%
g 4
14.3%
i 4
14.3%
t 2
7.1%
f 2
7.1%
n 2
7.1%
c 2
7.1%
a 2
7.1%
e 2
7.1%
o 2
7.1%
Decimal Number
ValueCountFrequency (%)
6 2559
31.1%
5 1965
23.9%
3 1567
19.1%
7 926
 
11.3%
2 439
 
5.3%
0 305
 
3.7%
8 295
 
3.6%
4 116
 
1.4%
1 51
 
0.6%
Other Punctuation
ValueCountFrequency (%)
. 76
98.7%
/ 1
 
1.3%
Math Symbol
ValueCountFrequency (%)
+ 76
100.0%
Space Separator
ValueCountFrequency (%)
29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 125128
93.7%
Common 8413
 
6.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
L 54412
43.5%
P 15858
 
12.7%
G 15723
 
12.6%
C 15107
 
12.1%
X 7412
 
5.9%
T 4108
 
3.3%
R 3856
 
3.1%
S 2676
 
2.1%
H 1244
 
1.0%
Z 1015
 
0.8%
Other values (25) 3717
 
3.0%
Common
ValueCountFrequency (%)
6 2559
30.4%
5 1965
23.4%
3 1567
18.6%
7 926
 
11.0%
2 439
 
5.2%
0 305
 
3.6%
8 295
 
3.5%
4 116
 
1.4%
. 76
 
0.9%
+ 76
 
0.9%
Other values (5) 89
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 133541
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
L 54412
40.7%
P 15858
 
11.9%
G 15723
 
11.8%
C 15107
 
11.3%
X 7412
 
5.6%
T 4108
 
3.1%
R 3856
 
2.9%
S 2676
 
2.0%
6 2559
 
1.9%
5 1965
 
1.5%
Other values (40) 9865
 
7.4%

ProductSize
Categorical

Distinct6
Distinct (%)< 0.1%
Missing197516
Missing (%)53.0%
Memory size2.8 MiB
Medium
59334 
Large / Medium
46474 
Small
23743 
Mini
20876 
Large
19171 

Length

Max length14
Median length7
Mean length7.6700608
Min length4

Characters and Unicode

Total characters1345321
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMedium
2nd rowLarge
3rd rowMedium
4th rowCompact
5th rowMedium

Common Values

ValueCountFrequency (%)
Medium 59334
 
15.9%
Large / Medium 46474
 
12.5%
Small 23743
 
6.4%
Mini 20876
 
5.6%
Large 19171
 
5.1%
Compact 5801
 
1.6%
(Missing) 197516
53.0%

Length

2023-06-15T17:25:38.578536image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:38.683684image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
medium 105808
39.4%
large 65645
24.5%
46474
17.3%
small 23743
 
8.8%
mini 20876
 
7.8%
compact 5801
 
2.2%

Most occurring characters

ValueCountFrequency (%)
e 171453
12.7%
i 147560
11.0%
m 135352
10.1%
M 126684
9.4%
d 105808
7.9%
u 105808
7.9%
a 95189
7.1%
92948
 
6.9%
r 65645
 
4.9%
g 65645
 
4.9%
Other values (10) 233229
17.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 984026
73.1%
Uppercase Letter 221873
 
16.5%
Space Separator 92948
 
6.9%
Other Punctuation 46474
 
3.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 171453
17.4%
i 147560
15.0%
m 135352
13.8%
d 105808
10.8%
u 105808
10.8%
a 95189
9.7%
r 65645
 
6.7%
g 65645
 
6.7%
l 47486
 
4.8%
n 20876
 
2.1%
Other values (4) 23204
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
M 126684
57.1%
L 65645
29.6%
S 23743
 
10.7%
C 5801
 
2.6%
Space Separator
ValueCountFrequency (%)
92948
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 46474
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1205899
89.6%
Common 139422
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 171453
14.2%
i 147560
12.2%
m 135352
11.2%
M 126684
10.5%
d 105808
8.8%
u 105808
8.8%
a 95189
7.9%
r 65645
 
5.4%
g 65645
 
5.4%
L 65645
 
5.4%
Other values (8) 121110
10.0%
Common
ValueCountFrequency (%)
92948
66.7%
/ 46474
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1345321
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 171453
12.7%
i 147560
11.0%
m 135352
10.1%
M 126684
9.4%
d 105808
7.9%
u 105808
7.9%
a 95189
7.1%
92948
 
6.9%
r 65645
 
4.9%
g 65645
 
4.9%
Other values (10) 233229
17.3%
Distinct72
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.8 MiB
Backhoe Loader - 14.0 to 15.0 Ft Standard Digging Depth
54547 
Track Type Tractor, Dozer - 20.0 to 75.0 Horsepower
 
15082
Wheel Loader - 150.0 to 175.0 Horsepower
 
14587
Track Type Tractor, Dozer - 85.0 to 105.0 Horsepower
 
14412
Hydraulic Excavator, Track - 21.0 to 24.0 Metric Tons
 
12573
Other values (67)
261714 

Length

Max length64
Median length57
Mean length49.799989
Min length27

Characters and Unicode

Total characters18571163
Distinct characters54
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTrack Type Tractor, Dozer - 105.0 to 130.0 Horsepower
2nd rowWheel Loader - 120.0 to 135.0 Horsepower
3rd rowTrack Type Tractor, Dozer - 190.0 to 260.0 Horsepower
4th rowWheel Loader - Unidentified
5th rowTrack Type Tractor, Dozer - 20.0 to 75.0 Horsepower

Common Values

ValueCountFrequency (%)
Backhoe Loader - 14.0 to 15.0 Ft Standard Digging Depth 54547
 
14.6%
Track Type Tractor, Dozer - 20.0 to 75.0 Horsepower 15082
 
4.0%
Wheel Loader - 150.0 to 175.0 Horsepower 14587
 
3.9%
Track Type Tractor, Dozer - 85.0 to 105.0 Horsepower 14412
 
3.9%
Hydraulic Excavator, Track - 21.0 to 24.0 Metric Tons 12573
 
3.4%
Track Type Tractor, Dozer - 130.0 to 160.0 Horsepower 10989
 
2.9%
Track Type Tractor, Dozer - 260.0 + Horsepower 10354
 
2.8%
Backhoe Loader - 15.0 to 16.0 Ft Standard Digging Depth 10170
 
2.7%
Hydraulic Excavator, Track - 12.0 to 14.0 Metric Tons 10092
 
2.7%
Wheel Loader - 120.0 to 135.0 Horsepower 10007
 
2.7%
Other values (62) 210102
56.3%

Length

2023-06-15T17:25:39.156451image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
389300
 
12.1%
to 349814
 
10.8%
loader 183319
 
5.7%
track 166329
 
5.2%
horsepower 164122
 
5.1%
excavator 91214
 
2.8%
hydraulic 91214
 
2.8%
tons 91093
 
2.8%
metric 91093
 
2.8%
backhoe 76398
 
2.4%
Other values (73) 1533381
47.5%

Most occurring characters

ValueCountFrequency (%)
2854362
15.4%
r 1401019
 
7.5%
o 1316927
 
7.1%
e 1188214
 
6.4%
a 1065360
 
5.7%
0 1021476
 
5.5%
t 976191
 
5.3%
. 718608
 
3.9%
c 631522
 
3.4%
1 534902
 
2.9%
Other values (44) 6862582
37.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 10174606
54.8%
Space Separator 2854362
 
15.4%
Decimal Number 2495754
 
13.4%
Uppercase Letter 1769555
 
9.5%
Other Punctuation 884937
 
4.8%
Dash Punctuation 372915
 
2.0%
Math Symbol 18980
 
0.1%
Open Punctuation 27
 
< 0.1%
Close Punctuation 27
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 1401019
13.8%
o 1316927
12.9%
e 1188214
11.7%
a 1065360
10.5%
t 976191
9.6%
c 631522
 
6.2%
d 493844
 
4.9%
i 462709
 
4.5%
p 392948
 
3.9%
n 286442
 
2.8%
Other values (14) 1959430
19.3%
Uppercase Letter
ValueCountFrequency (%)
T 407652
23.0%
H 255336
14.4%
L 223424
12.6%
D 222063
12.5%
S 155182
 
8.8%
M 114360
 
6.5%
E 91214
 
5.2%
B 76398
 
4.3%
F 73474
 
4.2%
W 66067
 
3.7%
Other values (3) 84385
 
4.8%
Decimal Number
ValueCountFrequency (%)
0 1021476
40.9%
1 534902
21.4%
5 273558
 
11.0%
2 176686
 
7.1%
4 133428
 
5.3%
3 102332
 
4.1%
6 93801
 
3.8%
7 85136
 
3.4%
8 45440
 
1.8%
9 28995
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 718608
81.2%
, 166329
 
18.8%
Space Separator
ValueCountFrequency (%)
2854362
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 372915
100.0%
Math Symbol
ValueCountFrequency (%)
+ 18980
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 11944161
64.3%
Common 6627002
35.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 1401019
 
11.7%
o 1316927
 
11.0%
e 1188214
 
9.9%
a 1065360
 
8.9%
t 976191
 
8.2%
c 631522
 
5.3%
d 493844
 
4.1%
i 462709
 
3.9%
T 407652
 
3.4%
p 392948
 
3.3%
Other values (27) 3607775
30.2%
Common
ValueCountFrequency (%)
2854362
43.1%
0 1021476
 
15.4%
. 718608
 
10.8%
1 534902
 
8.1%
- 372915
 
5.6%
5 273558
 
4.1%
2 176686
 
2.7%
, 166329
 
2.5%
4 133428
 
2.0%
3 102332
 
1.5%
Other values (7) 272406
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18571163
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2854362
15.4%
r 1401019
 
7.5%
o 1316927
 
7.1%
e 1188214
 
6.4%
a 1065360
 
5.7%
0 1021476
 
5.5%
t 976191
 
5.3%
. 718608
 
3.9%
c 631522
 
3.4%
1 534902
 
2.9%
Other values (44) 6862582
37.0%

state
Categorical

Distinct53
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.8 MiB
Florida
60971 
Texas
49135 
California
27534 
Washington
 
14537
Georgia
 
13799
Other values (48)
206939 

Length

Max length14
Median length12
Mean length8.0555033
Min length4

Characters and Unicode

Total characters3004018
Distinct characters46
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTexas
2nd rowFlorida
3rd rowFlorida
4th rowFlorida
5th rowFlorida

Common Values

ValueCountFrequency (%)
Florida 60971
16.3%
Texas 49135
 
13.2%
California 27534
 
7.4%
Washington 14537
 
3.9%
Georgia 13799
 
3.7%
Maryland 12190
 
3.3%
Mississippi 11760
 
3.2%
Ohio 10871
 
2.9%
Colorado 10802
 
2.9%
Illinois 10607
 
2.8%
Other values (43) 150709
40.4%

Length

2023-06-15T17:25:39.264158image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
florida 60971
 
14.7%
texas 49135
 
11.8%
california 27534
 
6.6%
new 22951
 
5.5%
carolina 17989
 
4.3%
washington 14539
 
3.5%
georgia 13799
 
3.3%
maryland 12190
 
2.9%
mississippi 11760
 
2.8%
ohio 10871
 
2.6%
Other values (46) 173571
41.8%

Most occurring characters

ValueCountFrequency (%)
a 391970
13.0%
i 334912
 
11.1%
o 265236
 
8.8%
n 224158
 
7.5%
s 205032
 
6.8%
e 198066
 
6.6%
r 198033
 
6.6%
l 170829
 
5.7%
d 98820
 
3.3%
t 65643
 
2.2%
Other values (36) 851319
28.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2546311
84.8%
Uppercase Letter 415312
 
13.8%
Space Separator 42395
 
1.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 391970
15.4%
i 334912
13.2%
o 265236
10.4%
n 224158
8.8%
s 205032
8.1%
e 198066
7.8%
r 198033
7.8%
l 170829
6.7%
d 98820
 
3.9%
t 65643
 
2.6%
Other values (14) 393612
15.5%
Uppercase Letter
ValueCountFrequency (%)
C 63660
15.3%
F 60971
14.7%
T 58525
14.1%
M 47946
11.5%
N 40511
9.8%
A 21769
 
5.2%
W 19097
 
4.6%
I 17264
 
4.2%
O 13812
 
3.3%
G 13799
 
3.3%
Other values (11) 57958
14.0%
Space Separator
ValueCountFrequency (%)
42395
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2961623
98.6%
Common 42395
 
1.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 391970
13.2%
i 334912
11.3%
o 265236
 
9.0%
n 224158
 
7.6%
s 205032
 
6.9%
e 198066
 
6.7%
r 198033
 
6.7%
l 170829
 
5.8%
d 98820
 
3.3%
t 65643
 
2.2%
Other values (35) 808924
27.3%
Common
ValueCountFrequency (%)
42395
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3004018
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 391970
13.0%
i 334912
 
11.1%
o 265236
 
8.8%
n 224158
 
7.5%
s 205032
 
6.8%
e 198066
 
6.6%
r 198033
 
6.6%
l 170829
 
5.7%
d 98820
 
3.3%
t 65643
 
2.2%
Other values (36) 851319
28.3%

ProductGroup
Categorical

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.8 MiB
TEX
91214 
BL
76398 
TTT
75115 
WL
66067 
SSL
40854 

Length

Max length3
Median length3
Mean length2.555577
Min length2

Characters and Unicode

Total characters953013
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTTT
2nd rowWL
3rd rowTTT
4th rowWL
5th rowTTT

Common Values

ValueCountFrequency (%)
TEX 91214
24.5%
BL 76398
20.5%
TTT 75115
20.1%
WL 66067
17.7%
SSL 40854
11.0%
MG 23267
 
6.2%

Length

2023-06-15T17:25:39.350193image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:39.454540image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
tex 91214
24.5%
bl 76398
20.5%
ttt 75115
20.1%
wl 66067
17.7%
ssl 40854
11.0%
mg 23267
 
6.2%

Most occurring characters

ValueCountFrequency (%)
T 316559
33.2%
L 183319
19.2%
E 91214
 
9.6%
X 91214
 
9.6%
S 81708
 
8.6%
B 76398
 
8.0%
W 66067
 
6.9%
M 23267
 
2.4%
G 23267
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 953013
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 316559
33.2%
L 183319
19.2%
E 91214
 
9.6%
X 91214
 
9.6%
S 81708
 
8.6%
B 76398
 
8.0%
W 66067
 
6.9%
M 23267
 
2.4%
G 23267
 
2.4%

Most occurring scripts

ValueCountFrequency (%)
Latin 953013
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 316559
33.2%
L 183319
19.2%
E 91214
 
9.6%
X 91214
 
9.6%
S 81708
 
8.6%
B 76398
 
8.0%
W 66067
 
6.9%
M 23267
 
2.4%
G 23267
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 953013
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
T 316559
33.2%
L 183319
19.2%
E 91214
 
9.6%
X 91214
 
9.6%
S 81708
 
8.6%
B 76398
 
8.0%
W 66067
 
6.9%
M 23267
 
2.4%
G 23267
 
2.4%

ProductGroupDesc
Categorical

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.8 MiB
Track Excavators
91214 
Backhoe Loaders
76398 
Track Type Tractors
75115 
Wheel Loader
66067 
Skid Steer Loaders
40854 

Length

Max length19
Median length16
Mean length15.722687
Min length12

Characters and Unicode

Total characters5863226
Distinct characters25
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTrack Type Tractors
2nd rowWheel Loader
3rd rowTrack Type Tractors
4th rowWheel Loader
5th rowTrack Type Tractors

Common Values

ValueCountFrequency (%)
Track Excavators 91214
24.5%
Backhoe Loaders 76398
20.5%
Track Type Tractors 75115
20.1%
Wheel Loader 66067
17.7%
Skid Steer Loaders 40854
11.0%
Motor Graders 23267
 
6.2%

Length

2023-06-15T17:25:39.559504image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:39.667656image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
track 166329
19.3%
loaders 117252
13.6%
excavators 91214
10.6%
backhoe 76398
8.9%
type 75115
8.7%
tractors 75115
8.7%
wheel 66067
 
7.7%
loader 66067
 
7.7%
skid 40854
 
4.7%
steer 40854
 
4.7%
Other values (2) 46534
 
5.4%

Most occurring characters

ValueCountFrequency (%)
a 706856
12.1%
r 701747
12.0%
e 571941
9.8%
488884
 
8.3%
o 472580
 
8.1%
c 409056
 
7.0%
T 316559
 
5.4%
s 306848
 
5.2%
k 283581
 
4.8%
d 247440
 
4.2%
Other values (15) 1357734
23.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4512543
77.0%
Uppercase Letter 861799
 
14.7%
Space Separator 488884
 
8.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 706856
15.7%
r 701747
15.6%
e 571941
12.7%
o 472580
10.5%
c 409056
9.1%
s 306848
6.8%
k 283581
6.3%
d 247440
 
5.5%
t 230450
 
5.1%
h 142465
 
3.2%
Other values (6) 439579
9.7%
Uppercase Letter
ValueCountFrequency (%)
T 316559
36.7%
L 183319
21.3%
E 91214
 
10.6%
S 81708
 
9.5%
B 76398
 
8.9%
W 66067
 
7.7%
M 23267
 
2.7%
G 23267
 
2.7%
Space Separator
ValueCountFrequency (%)
488884
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5374342
91.7%
Common 488884
 
8.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 706856
13.2%
r 701747
13.1%
e 571941
10.6%
o 472580
8.8%
c 409056
 
7.6%
T 316559
 
5.9%
s 306848
 
5.7%
k 283581
 
5.3%
d 247440
 
4.6%
t 230450
 
4.3%
Other values (14) 1127284
21.0%
Common
ValueCountFrequency (%)
488884
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5863226
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 706856
12.1%
r 701747
12.0%
e 571941
9.8%
488884
 
8.3%
o 472580
 
8.1%
c 409056
 
7.0%
T 316559
 
5.4%
s 306848
 
5.2%
k 283581
 
4.8%
d 247440
 
4.2%
Other values (15) 1357734
23.2%

Drive_System
Categorical

Distinct4
Distinct (%)< 0.1%
Missing273743
Missing (%)73.4%
Memory size2.8 MiB
Two Wheel Drive
44120 
Four Wheel Drive
32002 
No
22341 
All Wheel Drive
 
709

Length

Max length16
Median length15
Mean length12.394113
Min length2

Characters and Unicode

Total characters1229149
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowFour Wheel Drive
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
Two Wheel Drive 44120
 
11.8%
Four Wheel Drive 32002
 
8.6%
No 22341
 
6.0%
All Wheel Drive 709
 
0.2%
(Missing) 273743
73.4%

Length

2023-06-15T17:25:39.771939image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:39.862774image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
wheel 76831
30.4%
drive 76831
30.4%
two 44120
17.5%
four 32002
12.7%
no 22341
 
8.8%
all 709
 
0.3%

Most occurring characters

ValueCountFrequency (%)
e 230493
18.8%
153662
12.5%
r 108833
8.9%
o 98463
8.0%
l 78249
 
6.4%
W 76831
 
6.3%
h 76831
 
6.3%
D 76831
 
6.3%
i 76831
 
6.3%
v 76831
 
6.3%
Other values (6) 175294
14.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 822653
66.9%
Uppercase Letter 252834
 
20.6%
Space Separator 153662
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 230493
28.0%
r 108833
13.2%
o 98463
12.0%
l 78249
 
9.5%
h 76831
 
9.3%
i 76831
 
9.3%
v 76831
 
9.3%
w 44120
 
5.4%
u 32002
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
W 76831
30.4%
D 76831
30.4%
T 44120
17.5%
F 32002
12.7%
N 22341
 
8.8%
A 709
 
0.3%
Space Separator
ValueCountFrequency (%)
153662
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1075487
87.5%
Common 153662
 
12.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 230493
21.4%
r 108833
10.1%
o 98463
9.2%
l 78249
 
7.3%
W 76831
 
7.1%
h 76831
 
7.1%
D 76831
 
7.1%
i 76831
 
7.1%
v 76831
 
7.1%
T 44120
 
4.1%
Other values (5) 131174
12.2%
Common
ValueCountFrequency (%)
153662
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1229149
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 230493
18.8%
153662
12.5%
r 108833
8.9%
o 98463
8.0%
l 78249
 
6.4%
W 76831
 
6.3%
h 76831
 
6.3%
D 76831
 
6.3%
i 76831
 
6.3%
v 76831
 
6.3%
Other values (6) 175294
14.3%

Enclosure
Categorical

Distinct6
Distinct (%)< 0.1%
Missing262
Missing (%)0.1%
Memory size2.8 MiB
OROPS
162358 
EROPS
123878 
EROPS w AC
86397 
EROPS AC
 
16
NO ROPS
 
2

Length

Max length19
Median length5
Mean length6.1594298
Min length5

Characters and Unicode

Total characters2295330
Distinct characters21
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOROPS
2nd rowOROPS
3rd rowOROPS
4th rowEROPS
5th rowOROPS

Common Values

ValueCountFrequency (%)
OROPS 162358
43.5%
EROPS 123878
33.2%
EROPS w AC 86397
23.2%
EROPS AC 16
 
< 0.1%
NO ROPS 2
 
< 0.1%
None or Unspecified 2
 
< 0.1%
(Missing) 262
 
0.1%

Length

2023-06-15T17:25:39.950210image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:40.052844image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
erops 210291
38.6%
orops 162358
29.8%
ac 86413
15.8%
w 86397
15.8%
no 2
 
< 0.1%
rops 2
 
< 0.1%
none 2
 
< 0.1%
or 2
 
< 0.1%
unspecified 2
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
O 535011
23.3%
P 372651
16.2%
S 372651
16.2%
R 372651
16.2%
E 210291
 
9.2%
172816
 
7.5%
A 86413
 
3.8%
C 86413
 
3.8%
w 86397
 
3.8%
e 6
 
< 0.1%
Other values (11) 30
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 2036087
88.7%
Space Separator 172816
 
7.5%
Lowercase Letter 86427
 
3.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 86397
> 99.9%
e 6
 
< 0.1%
i 4
 
< 0.1%
o 4
 
< 0.1%
n 4
 
< 0.1%
r 2
 
< 0.1%
s 2
 
< 0.1%
p 2
 
< 0.1%
c 2
 
< 0.1%
f 2
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
O 535011
26.3%
P 372651
18.3%
S 372651
18.3%
R 372651
18.3%
E 210291
 
10.3%
A 86413
 
4.2%
C 86413
 
4.2%
N 4
 
< 0.1%
U 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
172816
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2122514
92.5%
Common 172816
 
7.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
O 535011
25.2%
P 372651
17.6%
S 372651
17.6%
R 372651
17.6%
E 210291
 
9.9%
A 86413
 
4.1%
C 86413
 
4.1%
w 86397
 
4.1%
e 6
 
< 0.1%
i 4
 
< 0.1%
Other values (10) 26
 
< 0.1%
Common
ValueCountFrequency (%)
172816
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2295330
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
O 535011
23.3%
P 372651
16.2%
S 372651
16.2%
R 372651
16.2%
E 210291
 
9.2%
172816
 
7.5%
A 86413
 
3.8%
C 86413
 
3.8%
w 86397
 
3.8%
e 6
 
< 0.1%
Other values (11) 30
 
< 0.1%

Forks
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing191074
Missing (%)51.2%
Memory size2.8 MiB
None or Unspecified
168050 
Yes
 
13791

Length

Max length19
Median length19
Mean length17.786544
Min length3

Characters and Unicode

Total characters3234323
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowYes
4th rowYes
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 168050
45.1%
Yes 13791
 
3.7%
(Missing) 191074
51.2%

Length

2023-06-15T17:25:40.149639image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:40.242163image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 168050
32.4%
or 168050
32.4%
unspecified 168050
32.4%
yes 13791
 
2.7%

Most occurring characters

ValueCountFrequency (%)
e 517941
16.0%
o 336100
10.4%
n 336100
10.4%
336100
10.4%
i 336100
10.4%
s 181841
 
5.6%
N 168050
 
5.2%
r 168050
 
5.2%
U 168050
 
5.2%
p 168050
 
5.2%
Other values (4) 517941
16.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2548332
78.8%
Uppercase Letter 349891
 
10.8%
Space Separator 336100
 
10.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 517941
20.3%
o 336100
13.2%
n 336100
13.2%
i 336100
13.2%
s 181841
 
7.1%
r 168050
 
6.6%
p 168050
 
6.6%
c 168050
 
6.6%
f 168050
 
6.6%
d 168050
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
N 168050
48.0%
U 168050
48.0%
Y 13791
 
3.9%
Space Separator
ValueCountFrequency (%)
336100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2898223
89.6%
Common 336100
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 517941
17.9%
o 336100
11.6%
n 336100
11.6%
i 336100
11.6%
s 181841
 
6.3%
N 168050
 
5.8%
r 168050
 
5.8%
U 168050
 
5.8%
p 168050
 
5.8%
c 168050
 
5.8%
Other values (3) 349891
12.1%
Common
ValueCountFrequency (%)
336100
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3234323
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 517941
16.0%
o 336100
10.4%
n 336100
10.4%
336100
10.4%
i 336100
10.4%
s 181841
 
5.6%
N 168050
 
5.2%
r 168050
 
5.2%
U 168050
 
5.2%
p 168050
 
5.2%
Other values (4) 517941
16.0%

Pad_Type
Categorical

IMBALANCE  MISSING 

Distinct4
Distinct (%)< 0.1%
Missing296794
Missing (%)79.6%
Memory size2.8 MiB
None or Unspecified
67651 
Reversible
 
5800
Street
 
2645
Grouser
 
25

Length

Max length19
Median length19
Mean length17.858594
Min length6

Characters and Unicode

Total characters1359414
Distinct characters21
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 67651
 
18.1%
Reversible 5800
 
1.6%
Street 2645
 
0.7%
Grouser 25
 
< 0.1%
(Missing) 296794
79.6%

Length

2023-06-15T17:25:40.321053image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:40.440284image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 67651
32.0%
or 67651
32.0%
unspecified 67651
32.0%
reversible 5800
 
2.7%
street 2645
 
1.3%
grouser 25
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e 225668
16.6%
i 141102
10.4%
o 135327
10.0%
n 135302
10.0%
135302
10.0%
r 76146
 
5.6%
s 73476
 
5.4%
d 67651
 
5.0%
f 67651
 
5.0%
N 67651
 
5.0%
Other values (11) 234138
17.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1080340
79.5%
Uppercase Letter 143772
 
10.6%
Space Separator 135302
 
10.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 225668
20.9%
i 141102
13.1%
o 135327
12.5%
n 135302
12.5%
r 76146
 
7.0%
s 73476
 
6.8%
d 67651
 
6.3%
f 67651
 
6.3%
c 67651
 
6.3%
p 67651
 
6.3%
Other values (5) 22715
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
N 67651
47.1%
U 67651
47.1%
R 5800
 
4.0%
S 2645
 
1.8%
G 25
 
< 0.1%
Space Separator
ValueCountFrequency (%)
135302
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1224112
90.0%
Common 135302
 
10.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 225668
18.4%
i 141102
11.5%
o 135327
11.1%
n 135302
11.1%
r 76146
 
6.2%
s 73476
 
6.0%
d 67651
 
5.5%
f 67651
 
5.5%
N 67651
 
5.5%
c 67651
 
5.5%
Other values (10) 166487
13.6%
Common
ValueCountFrequency (%)
135302
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1359414
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 225668
16.6%
i 141102
10.4%
o 135327
10.0%
n 135302
10.0%
135302
10.0%
r 76146
 
5.6%
s 73476
 
5.4%
d 67651
 
5.0%
f 67651
 
5.0%
N 67651
 
5.0%
Other values (11) 234138
17.2%

Ride_Control
Categorical

Distinct3
Distinct (%)< 0.1%
Missing231900
Missing (%)62.2%
Memory size2.8 MiB
No
74438 
None or Unspecified
58219 
Yes
8358 

Length

Max length19
Median length2
Mean length9.0778357
Min length2

Characters and Unicode

Total characters1280111
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNo

Common Values

ValueCountFrequency (%)
No 74438
 
20.0%
None or Unspecified 58219
 
15.6%
Yes 8358
 
2.2%
(Missing) 231900
62.2%

Length

2023-06-15T17:25:40.531916image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:40.623740image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
no 74438
28.9%
none 58219
22.6%
or 58219
22.6%
unspecified 58219
22.6%
yes 8358
 
3.2%

Most occurring characters

ValueCountFrequency (%)
o 190876
14.9%
e 183015
14.3%
N 132657
10.4%
n 116438
9.1%
116438
9.1%
i 116438
9.1%
s 66577
 
5.2%
r 58219
 
4.5%
U 58219
 
4.5%
p 58219
 
4.5%
Other values (4) 183015
14.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 964439
75.3%
Uppercase Letter 199234
 
15.6%
Space Separator 116438
 
9.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 190876
19.8%
e 183015
19.0%
n 116438
12.1%
i 116438
12.1%
s 66577
 
6.9%
r 58219
 
6.0%
p 58219
 
6.0%
c 58219
 
6.0%
f 58219
 
6.0%
d 58219
 
6.0%
Uppercase Letter
ValueCountFrequency (%)
N 132657
66.6%
U 58219
29.2%
Y 8358
 
4.2%
Space Separator
ValueCountFrequency (%)
116438
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1163673
90.9%
Common 116438
 
9.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 190876
16.4%
e 183015
15.7%
N 132657
11.4%
n 116438
10.0%
i 116438
10.0%
s 66577
 
5.7%
r 58219
 
5.0%
U 58219
 
5.0%
p 58219
 
5.0%
c 58219
 
5.0%
Other values (3) 124796
10.7%
Common
ValueCountFrequency (%)
116438
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1280111
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 190876
14.9%
e 183015
14.3%
N 132657
10.4%
n 116438
9.1%
116438
9.1%
i 116438
9.1%
s 66577
 
5.2%
r 58219
 
4.5%
U 58219
 
4.5%
p 58219
 
4.5%
Other values (4) 183015
14.3%

Stick
Categorical

Distinct2
Distinct (%)< 0.1%
Missing296794
Missing (%)79.6%
Memory size2.8 MiB
Standard
46491 
Extended
29630 

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters608968
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStandard
2nd rowStandard
3rd rowStandard
4th rowStandard
5th rowStandard

Common Values

ValueCountFrequency (%)
Standard 46491
 
12.5%
Extended 29630
 
7.9%
(Missing) 296794
79.6%

Length

2023-06-15T17:25:40.703546image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:40.789853image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
standard 46491
61.1%
extended 29630
38.9%

Most occurring characters

ValueCountFrequency (%)
d 152242
25.0%
a 92982
15.3%
t 76121
12.5%
n 76121
12.5%
e 59260
 
9.7%
S 46491
 
7.6%
r 46491
 
7.6%
E 29630
 
4.9%
x 29630
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 532847
87.5%
Uppercase Letter 76121
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
d 152242
28.6%
a 92982
17.5%
t 76121
14.3%
n 76121
14.3%
e 59260
 
11.1%
r 46491
 
8.7%
x 29630
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
S 46491
61.1%
E 29630
38.9%

Most occurring scripts

ValueCountFrequency (%)
Latin 608968
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
d 152242
25.0%
a 92982
15.3%
t 76121
12.5%
n 76121
12.5%
e 59260
 
9.7%
S 46491
 
7.6%
r 46491
 
7.6%
E 29630
 
4.9%
x 29630
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 608968
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
d 152242
25.0%
a 92982
15.3%
t 76121
12.5%
n 76121
12.5%
e 59260
 
9.7%
S 46491
 
7.6%
r 46491
 
7.6%
E 29630
 
4.9%
x 29630
 
4.9%

Transmission
Categorical

IMBALANCE  MISSING 

Distinct8
Distinct (%)< 0.1%
Missing199883
Missing (%)53.6%
Memory size2.8 MiB
Standard
134104 
None or Unspecified
21510 
Powershift
 
9867
Powershuttle
 
3947
Hydrostatic
 
3167
Other values (3)
 
437

Length

Max length19
Median length8
Mean length9.6350848
Min length8

Characters and Unicode

Total characters1667178
Distinct characters26
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDirect Drive
2nd rowStandard
3rd rowStandard
4th rowStandard
5th rowPowershift

Common Values

ValueCountFrequency (%)
Standard 134104
36.0%
None or Unspecified 21510
 
5.8%
Powershift 9867
 
2.6%
Powershuttle 3947
 
1.1%
Hydrostatic 3167
 
0.8%
Direct Drive 284
 
0.1%
Autoshift 114
 
< 0.1%
AutoShift 39
 
< 0.1%
(Missing) 199883
53.6%

Length

2023-06-15T17:25:40.869840image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:40.975536image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
standard 134104
62.0%
none 21510
 
9.9%
or 21510
 
9.9%
unspecified 21510
 
9.9%
powershift 9867
 
4.6%
powershuttle 3947
 
1.8%
hydrostatic 3167
 
1.5%
direct 284
 
0.1%
drive 284
 
0.1%
autoshift 153
 
0.1%

Most occurring characters

ValueCountFrequency (%)
d 292885
17.6%
a 271375
16.3%
n 177124
10.6%
r 173163
10.4%
t 158789
9.5%
S 134143
8.0%
e 82859
 
5.0%
o 60154
 
3.6%
i 56775
 
3.4%
43304
 
2.6%
Other values (16) 216607
13.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1429009
85.7%
Uppercase Letter 194865
 
11.7%
Space Separator 43304
 
2.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
d 292885
20.5%
a 271375
19.0%
n 177124
12.4%
r 173163
12.1%
t 158789
11.1%
e 82859
 
5.8%
o 60154
 
4.2%
i 56775
 
4.0%
s 38605
 
2.7%
f 31530
 
2.2%
Other values (8) 85750
 
6.0%
Uppercase Letter
ValueCountFrequency (%)
S 134143
68.8%
U 21510
 
11.0%
N 21510
 
11.0%
P 13814
 
7.1%
H 3167
 
1.6%
D 568
 
0.3%
A 153
 
0.1%
Space Separator
ValueCountFrequency (%)
43304
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1623874
97.4%
Common 43304
 
2.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
d 292885
18.0%
a 271375
16.7%
n 177124
10.9%
r 173163
10.7%
t 158789
9.8%
S 134143
8.3%
e 82859
 
5.1%
o 60154
 
3.7%
i 56775
 
3.5%
s 38605
 
2.4%
Other values (15) 178002
11.0%
Common
ValueCountFrequency (%)
43304
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1667178
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
d 292885
17.6%
a 271375
16.3%
n 177124
10.6%
r 173163
10.4%
t 158789
9.5%
S 134143
8.0%
e 82859
 
5.0%
o 60154
 
3.6%
i 56775
 
3.4%
43304
 
2.6%
Other values (16) 216607
13.0%

Turbocharged
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing296794
Missing (%)79.6%
Memory size2.8 MiB
None or Unspecified
72254 
Yes
 
3867

Length

Max length19
Median length19
Mean length18.187189
Min length3

Characters and Unicode

Total characters1384427
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 72254
 
19.4%
Yes 3867
 
1.0%
(Missing) 296794
79.6%

Length

2023-06-15T17:25:41.081524image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:41.176055image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 72254
32.7%
or 72254
32.7%
unspecified 72254
32.7%
yes 3867
 
1.8%

Most occurring characters

ValueCountFrequency (%)
e 220629
15.9%
o 144508
10.4%
n 144508
10.4%
144508
10.4%
i 144508
10.4%
s 76121
 
5.5%
N 72254
 
5.2%
r 72254
 
5.2%
U 72254
 
5.2%
p 72254
 
5.2%
Other values (4) 220629
15.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1091544
78.8%
Uppercase Letter 148375
 
10.7%
Space Separator 144508
 
10.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 220629
20.2%
o 144508
13.2%
n 144508
13.2%
i 144508
13.2%
s 76121
 
7.0%
r 72254
 
6.6%
p 72254
 
6.6%
c 72254
 
6.6%
f 72254
 
6.6%
d 72254
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
N 72254
48.7%
U 72254
48.7%
Y 3867
 
2.6%
Space Separator
ValueCountFrequency (%)
144508
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1239919
89.6%
Common 144508
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 220629
17.8%
o 144508
11.7%
n 144508
11.7%
i 144508
11.7%
s 76121
 
6.1%
N 72254
 
5.8%
r 72254
 
5.8%
U 72254
 
5.8%
p 72254
 
5.8%
c 72254
 
5.8%
Other values (3) 148375
12.0%
Common
ValueCountFrequency (%)
144508
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1384427
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 220629
15.9%
o 144508
10.4%
n 144508
10.4%
144508
10.4%
i 144508
10.4%
s 76121
 
5.5%
N 72254
 
5.2%
r 72254
 
5.2%
U 72254
 
5.2%
p 72254
 
5.2%
Other values (4) 220629
15.9%

Blade_Extension
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing349872
Missing (%)93.8%
Memory size2.8 MiB
None or Unspecified
22507 
Yes
 
536

Length

Max length19
Median length19
Mean length18.627826
Min length3

Characters and Unicode

Total characters429241
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 22507
 
6.0%
Yes 536
 
0.1%
(Missing) 349872
93.8%

Length

2023-06-15T17:25:41.253353image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:41.337659image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 22507
33.1%
or 22507
33.1%
unspecified 22507
33.1%
yes 536
 
0.8%

Most occurring characters

ValueCountFrequency (%)
e 68057
15.9%
o 45014
10.5%
n 45014
10.5%
45014
10.5%
i 45014
10.5%
s 23043
 
5.4%
N 22507
 
5.2%
r 22507
 
5.2%
U 22507
 
5.2%
p 22507
 
5.2%
Other values (4) 68057
15.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 338677
78.9%
Uppercase Letter 45550
 
10.6%
Space Separator 45014
 
10.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 68057
20.1%
o 45014
13.3%
n 45014
13.3%
i 45014
13.3%
s 23043
 
6.8%
r 22507
 
6.6%
p 22507
 
6.6%
c 22507
 
6.6%
f 22507
 
6.6%
d 22507
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
N 22507
49.4%
U 22507
49.4%
Y 536
 
1.2%
Space Separator
ValueCountFrequency (%)
45014
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 384227
89.5%
Common 45014
 
10.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 68057
17.7%
o 45014
11.7%
n 45014
11.7%
i 45014
11.7%
s 23043
 
6.0%
N 22507
 
5.9%
r 22507
 
5.9%
U 22507
 
5.9%
p 22507
 
5.9%
c 22507
 
5.9%
Other values (3) 45550
11.9%
Common
ValueCountFrequency (%)
45014
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 429241
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 68057
15.9%
o 45014
10.5%
n 45014
10.5%
45014
10.5%
i 45014
10.5%
s 23043
 
5.4%
N 22507
 
5.2%
r 22507
 
5.2%
U 22507
 
5.2%
p 22507
 
5.2%
Other values (4) 68057
15.9%

Blade_Width
Categorical

Distinct6
Distinct (%)< 0.1%
Missing349872
Missing (%)93.8%
Memory size2.8 MiB
14'
9042 
None or Unspecified
8240 
12'
4532 
16'
 
885
13'
 
281

Length

Max length19
Median length3
Mean length8.7242113
Min length3

Characters and Unicode

Total characters201032
Distinct characters20
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
14' 9042
 
2.4%
None or Unspecified 8240
 
2.2%
12' 4532
 
1.2%
16' 885
 
0.2%
13' 281
 
0.1%
<12' 63
 
< 0.1%
(Missing) 349872
93.8%

Length

2023-06-15T17:25:41.412624image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:41.512015image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
14 9042
22.9%
none 8240
20.8%
or 8240
20.8%
unspecified 8240
20.8%
12 4595
11.6%
16 885
 
2.2%
13 281
 
0.7%

Most occurring characters

ValueCountFrequency (%)
e 24720
12.3%
o 16480
 
8.2%
n 16480
 
8.2%
16480
 
8.2%
i 16480
 
8.2%
1 14803
 
7.4%
' 14803
 
7.4%
4 9042
 
4.5%
c 8240
 
4.1%
d 8240
 
4.1%
Other values (10) 55264
27.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 123600
61.5%
Decimal Number 29606
 
14.7%
Space Separator 16480
 
8.2%
Uppercase Letter 16480
 
8.2%
Other Punctuation 14803
 
7.4%
Math Symbol 63
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 24720
20.0%
o 16480
13.3%
n 16480
13.3%
i 16480
13.3%
c 8240
 
6.7%
d 8240
 
6.7%
f 8240
 
6.7%
s 8240
 
6.7%
p 8240
 
6.7%
r 8240
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 14803
50.0%
4 9042
30.5%
2 4595
 
15.5%
6 885
 
3.0%
3 281
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
U 8240
50.0%
N 8240
50.0%
Space Separator
ValueCountFrequency (%)
16480
100.0%
Other Punctuation
ValueCountFrequency (%)
' 14803
100.0%
Math Symbol
ValueCountFrequency (%)
< 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 140080
69.7%
Common 60952
30.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 24720
17.6%
o 16480
11.8%
n 16480
11.8%
i 16480
11.8%
c 8240
 
5.9%
d 8240
 
5.9%
f 8240
 
5.9%
s 8240
 
5.9%
p 8240
 
5.9%
U 8240
 
5.9%
Other values (2) 16480
11.8%
Common
ValueCountFrequency (%)
16480
27.0%
1 14803
24.3%
' 14803
24.3%
4 9042
14.8%
2 4595
 
7.5%
6 885
 
1.5%
3 281
 
0.5%
< 63
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 201032
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 24720
12.3%
o 16480
 
8.2%
n 16480
 
8.2%
16480
 
8.2%
i 16480
 
8.2%
1 14803
 
7.4%
' 14803
 
7.4%
4 9042
 
4.5%
c 8240
 
4.1%
d 8240
 
4.1%
Other values (10) 55264
27.5%

Enclosure_Type
Categorical

IMBALANCE  MISSING 

Distinct3
Distinct (%)< 0.1%
Missing349872
Missing (%)93.8%
Memory size2.8 MiB
None or Unspecified
19682 
Low Profile
2567 
High Profile
 
794

Length

Max length19
Median length19
Mean length17.867595
Min length11

Characters and Unicode

Total characters411723
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 19682
 
5.3%
Low Profile 2567
 
0.7%
High Profile 794
 
0.2%
(Missing) 349872
93.8%

Length

2023-06-15T17:25:41.603531image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:41.698162image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 19682
29.9%
or 19682
29.9%
unspecified 19682
29.9%
profile 3361
 
5.1%
low 2567
 
3.9%
high 794
 
1.2%

Most occurring characters

ValueCountFrequency (%)
e 62407
15.2%
o 45292
11.0%
i 43519
10.6%
42725
10.4%
n 39364
9.6%
r 23043
 
5.6%
f 23043
 
5.6%
c 19682
 
4.8%
d 19682
 
4.8%
N 19682
 
4.8%
Other values (10) 73284
17.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 322912
78.4%
Uppercase Letter 46086
 
11.2%
Space Separator 42725
 
10.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 62407
19.3%
o 45292
14.0%
i 43519
13.5%
n 39364
12.2%
r 23043
 
7.1%
f 23043
 
7.1%
c 19682
 
6.1%
d 19682
 
6.1%
p 19682
 
6.1%
s 19682
 
6.1%
Other values (4) 7516
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
N 19682
42.7%
U 19682
42.7%
P 3361
 
7.3%
L 2567
 
5.6%
H 794
 
1.7%
Space Separator
ValueCountFrequency (%)
42725
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 368998
89.6%
Common 42725
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 62407
16.9%
o 45292
12.3%
i 43519
11.8%
n 39364
10.7%
r 23043
 
6.2%
f 23043
 
6.2%
c 19682
 
5.3%
d 19682
 
5.3%
N 19682
 
5.3%
p 19682
 
5.3%
Other values (9) 53602
14.5%
Common
ValueCountFrequency (%)
42725
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 411723
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 62407
15.2%
o 45292
11.0%
i 43519
10.6%
42725
10.4%
n 39364
9.6%
r 23043
 
5.6%
f 23043
 
5.6%
c 19682
 
4.8%
d 19682
 
4.8%
N 19682
 
4.8%
Other values (10) 73284
17.8%

Engine_Horsepower
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing349872
Missing (%)93.8%
Memory size2.8 MiB
No
21754 
Variable
 
1289

Length

Max length8
Median length2
Mean length2.3356334
Min length2

Characters and Unicode

Total characters53820
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNo
2nd rowNo
3rd rowNo
4th rowNo
5th rowNo

Common Values

ValueCountFrequency (%)
No 21754
 
5.8%
Variable 1289
 
0.3%
(Missing) 349872
93.8%

Length

2023-06-15T17:25:41.775187image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:41.855307image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
no 21754
94.4%
variable 1289
 
5.6%

Most occurring characters

ValueCountFrequency (%)
N 21754
40.4%
o 21754
40.4%
a 2578
 
4.8%
V 1289
 
2.4%
r 1289
 
2.4%
i 1289
 
2.4%
b 1289
 
2.4%
l 1289
 
2.4%
e 1289
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 30777
57.2%
Uppercase Letter 23043
42.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 21754
70.7%
a 2578
 
8.4%
r 1289
 
4.2%
i 1289
 
4.2%
b 1289
 
4.2%
l 1289
 
4.2%
e 1289
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
N 21754
94.4%
V 1289
 
5.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 53820
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 21754
40.4%
o 21754
40.4%
a 2578
 
4.8%
V 1289
 
2.4%
r 1289
 
2.4%
i 1289
 
2.4%
b 1289
 
2.4%
l 1289
 
2.4%
e 1289
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 53820
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 21754
40.4%
o 21754
40.4%
a 2578
 
4.8%
V 1289
 
2.4%
r 1289
 
2.4%
i 1289
 
2.4%
b 1289
 
2.4%
l 1289
 
2.4%
e 1289
 
2.4%

Hydraulics
Categorical

Distinct12
Distinct (%)< 0.1%
Missing77373
Missing (%)20.7%
Memory size2.8 MiB
2 Valve
131417 
Standard
92283 
Auxiliary
40136 
Base + 1 Function
22563 
3 Valve
 
5486
Other values (7)
 
3657

Length

Max length19
Median length17
Mean length8.3691963
Min length7

Characters and Unicode

Total characters2473449
Distinct characters32
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2 Valve
2nd row2 Valve
3rd row2 Valve
4th row2 Valve
5th row2 Valve

Common Values

ValueCountFrequency (%)
2 Valve 131417
35.2%
Standard 92283
24.7%
Auxiliary 40136
 
10.8%
Base + 1 Function 22563
 
6.1%
3 Valve 5486
 
1.5%
4 Valve 3011
 
0.8%
Base + 3 Function 298
 
0.1%
Base + 2 Function 127
 
< 0.1%
Base + 5 Function 90
 
< 0.1%
Base + 4 Function 77
 
< 0.1%
Other values (2) 54
 
< 0.1%
(Missing) 77373
20.7%

Length

2023-06-15T17:25:41.924861image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
valve 139914
27.7%
2 131544
26.0%
standard 92283
18.3%
auxiliary 40136
 
7.9%
base 23204
 
4.6%
23204
 
4.6%
function 23204
 
4.6%
1 22563
 
4.5%
3 5784
 
1.1%
4 3088
 
0.6%
Other values (5) 154
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
a 387820
15.7%
209536
 
8.5%
d 184571
 
7.5%
l 180050
 
7.3%
e 163133
 
6.6%
V 139914
 
5.7%
v 139914
 
5.7%
n 138701
 
5.6%
r 132424
 
5.4%
2 131544
 
5.3%
Other values (22) 665842
26.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1758840
71.1%
Uppercase Letter 318751
 
12.9%
Space Separator 209536
 
8.5%
Decimal Number 163118
 
6.6%
Math Symbol 23204
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 387820
22.0%
d 184571
10.5%
l 180050
10.2%
e 163133
9.3%
v 139914
 
8.0%
n 138701
 
7.9%
r 132424
 
7.5%
t 115487
 
6.6%
i 103486
 
5.9%
u 63340
 
3.6%
Other values (7) 149914
 
8.5%
Uppercase Letter
ValueCountFrequency (%)
V 139914
43.9%
S 92283
29.0%
A 40136
 
12.6%
F 23204
 
7.3%
B 23204
 
7.3%
N 5
 
< 0.1%
U 5
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 131544
80.6%
1 22563
 
13.8%
3 5784
 
3.5%
4 3088
 
1.9%
5 90
 
0.1%
6 49
 
< 0.1%
Space Separator
ValueCountFrequency (%)
209536
100.0%
Math Symbol
ValueCountFrequency (%)
+ 23204
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2077591
84.0%
Common 395858
 
16.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 387820
18.7%
d 184571
8.9%
l 180050
8.7%
e 163133
7.9%
V 139914
 
6.7%
v 139914
 
6.7%
n 138701
 
6.7%
r 132424
 
6.4%
t 115487
 
5.6%
i 103486
 
5.0%
Other values (14) 392091
18.9%
Common
ValueCountFrequency (%)
209536
52.9%
2 131544
33.2%
+ 23204
 
5.9%
1 22563
 
5.7%
3 5784
 
1.5%
4 3088
 
0.8%
5 90
 
< 0.1%
6 49
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2473449
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 387820
15.7%
209536
 
8.5%
d 184571
 
7.5%
l 180050
 
7.3%
e 163133
 
6.6%
V 139914
 
5.7%
v 139914
 
5.7%
n 138701
 
5.6%
r 132424
 
5.4%
2 131544
 
5.3%
Other values (22) 665842
26.9%

Pushblock
Categorical

Distinct2
Distinct (%)< 0.1%
Missing349872
Missing (%)93.8%
Memory size2.8 MiB
None or Unspecified
17466 
Yes
5577 

Length

Max length19
Median length19
Mean length15.127588
Min length3

Characters and Unicode

Total characters348585
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowYes

Common Values

ValueCountFrequency (%)
None or Unspecified 17466
 
4.7%
Yes 5577
 
1.5%
(Missing) 349872
93.8%

Length

2023-06-15T17:25:42.012679image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:42.099542image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 17466
30.1%
or 17466
30.1%
unspecified 17466
30.1%
yes 5577
 
9.6%

Most occurring characters

ValueCountFrequency (%)
e 57975
16.6%
o 34932
10.0%
n 34932
10.0%
34932
10.0%
i 34932
10.0%
s 23043
 
6.6%
N 17466
 
5.0%
r 17466
 
5.0%
U 17466
 
5.0%
p 17466
 
5.0%
Other values (4) 57975
16.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 273144
78.4%
Uppercase Letter 40509
 
11.6%
Space Separator 34932
 
10.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 57975
21.2%
o 34932
12.8%
n 34932
12.8%
i 34932
12.8%
s 23043
 
8.4%
r 17466
 
6.4%
p 17466
 
6.4%
c 17466
 
6.4%
f 17466
 
6.4%
d 17466
 
6.4%
Uppercase Letter
ValueCountFrequency (%)
N 17466
43.1%
U 17466
43.1%
Y 5577
 
13.8%
Space Separator
ValueCountFrequency (%)
34932
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 313653
90.0%
Common 34932
 
10.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 57975
18.5%
o 34932
11.1%
n 34932
11.1%
i 34932
11.1%
s 23043
 
7.3%
N 17466
 
5.6%
r 17466
 
5.6%
U 17466
 
5.6%
p 17466
 
5.6%
c 17466
 
5.6%
Other values (3) 40509
12.9%
Common
ValueCountFrequency (%)
34932
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 348585
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 57975
16.6%
o 34932
10.0%
n 34932
10.0%
34932
10.0%
i 34932
10.0%
s 23043
 
6.6%
N 17466
 
5.0%
r 17466
 
5.0%
U 17466
 
5.0%
p 17466
 
5.0%
Other values (4) 57975
16.6%

Ripper
Categorical

Distinct4
Distinct (%)< 0.1%
Missing275994
Missing (%)74.0%
Memory size2.8 MiB
None or Unspecified
76851 
Multi Shank
 
7659
Yes
 
7572
Single Shank
 
4839

Length

Max length19
Median length19
Mean length16.768316
Min length3

Characters and Unicode

Total characters1625202
Distinct characters23
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 76851
 
20.6%
Multi Shank 7659
 
2.1%
Yes 7572
 
2.0%
Single Shank 4839
 
1.3%
(Missing) 275994
74.0%

Length

2023-06-15T17:25:42.173925image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:42.265880image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 76851
29.2%
or 76851
29.2%
unspecified 76851
29.2%
shank 12498
 
4.7%
multi 7659
 
2.9%
yes 7572
 
2.9%
single 4839
 
1.8%

Most occurring characters

ValueCountFrequency (%)
e 242964
14.9%
n 171039
10.5%
166200
10.2%
i 166200
10.2%
o 153702
9.5%
s 84423
 
5.2%
N 76851
 
4.7%
c 76851
 
4.7%
d 76851
 
4.7%
f 76851
 
4.7%
Other values (13) 333270
20.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1272732
78.3%
Uppercase Letter 186270
 
11.5%
Space Separator 166200
 
10.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 242964
19.1%
n 171039
13.4%
i 166200
13.1%
o 153702
12.1%
s 84423
 
6.6%
c 76851
 
6.0%
d 76851
 
6.0%
f 76851
 
6.0%
p 76851
 
6.0%
r 76851
 
6.0%
Other values (7) 70149
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
N 76851
41.3%
U 76851
41.3%
S 17337
 
9.3%
M 7659
 
4.1%
Y 7572
 
4.1%
Space Separator
ValueCountFrequency (%)
166200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1459002
89.8%
Common 166200
 
10.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 242964
16.7%
n 171039
11.7%
i 166200
11.4%
o 153702
10.5%
s 84423
 
5.8%
N 76851
 
5.3%
c 76851
 
5.3%
d 76851
 
5.3%
f 76851
 
5.3%
p 76851
 
5.3%
Other values (12) 256419
17.6%
Common
ValueCountFrequency (%)
166200
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1625202
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 242964
14.9%
n 171039
10.5%
166200
10.2%
i 166200
10.2%
o 153702
9.5%
s 84423
 
5.2%
N 76851
 
4.7%
c 76851
 
4.7%
d 76851
 
4.7%
f 76851
 
4.7%
Other values (13) 333270
20.5%

Scarifier
Categorical

Distinct2
Distinct (%)< 0.1%
Missing349861
Missing (%)93.8%
Memory size2.8 MiB
None or Unspecified
11529 
Yes
11525 

Length

Max length19
Median length19
Mean length11.001388
Min length3

Characters and Unicode

Total characters253626
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYes
2nd rowYes
3rd rowYes
4th rowYes
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 11529
 
3.1%
Yes 11525
 
3.1%
(Missing) 349861
93.8%

Length

2023-06-15T17:25:42.348654image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:42.434866image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 11529
25.0%
or 11529
25.0%
unspecified 11529
25.0%
yes 11525
25.0%

Most occurring characters

ValueCountFrequency (%)
e 46112
18.2%
o 23058
9.1%
n 23058
9.1%
23058
9.1%
i 23058
9.1%
s 23054
9.1%
N 11529
 
4.5%
r 11529
 
4.5%
U 11529
 
4.5%
p 11529
 
4.5%
Other values (4) 46112
18.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 195985
77.3%
Uppercase Letter 34583
 
13.6%
Space Separator 23058
 
9.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 46112
23.5%
o 23058
11.8%
n 23058
11.8%
i 23058
11.8%
s 23054
11.8%
r 11529
 
5.9%
p 11529
 
5.9%
c 11529
 
5.9%
f 11529
 
5.9%
d 11529
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
N 11529
33.3%
U 11529
33.3%
Y 11525
33.3%
Space Separator
ValueCountFrequency (%)
23058
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 230568
90.9%
Common 23058
 
9.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 46112
20.0%
o 23058
10.0%
n 23058
10.0%
i 23058
10.0%
s 23054
10.0%
N 11529
 
5.0%
r 11529
 
5.0%
U 11529
 
5.0%
p 11529
 
5.0%
c 11529
 
5.0%
Other values (3) 34583
15.0%
Common
ValueCountFrequency (%)
23058
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 253626
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 46112
18.2%
o 23058
9.1%
n 23058
9.1%
23058
9.1%
i 23058
9.1%
s 23054
9.1%
N 11529
 
4.5%
r 11529
 
4.5%
U 11529
 
4.5%
p 11529
 
4.5%
Other values (4) 46112
18.2%

Tip_Control
Categorical

Distinct3
Distinct (%)< 0.1%
Missing349872
Missing (%)93.8%
Memory size2.8 MiB
None or Unspecified
14451 
Sideshift & Tip
6760 
Tip
1832 

Length

Max length19
Median length19
Mean length16.554485
Min length3

Characters and Unicode

Total characters381465
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSideshift & Tip
2nd rowSideshift & Tip
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowSideshift & Tip

Common Values

ValueCountFrequency (%)
None or Unspecified 14451
 
3.9%
Sideshift & Tip 6760
 
1.8%
Tip 1832
 
0.5%
(Missing) 349872
93.8%

Length

2023-06-15T17:25:42.508961image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:42.595248image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 14451
22.1%
or 14451
22.1%
unspecified 14451
22.1%
tip 8592
13.1%
sideshift 6760
10.3%
6760
10.3%

Most occurring characters

ValueCountFrequency (%)
i 51014
13.4%
e 50113
13.1%
42422
11.1%
n 28902
 
7.6%
o 28902
 
7.6%
p 23043
 
6.0%
d 21211
 
5.6%
f 21211
 
5.6%
s 21211
 
5.6%
N 14451
 
3.8%
Other values (8) 78985
20.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 288029
75.5%
Uppercase Letter 44254
 
11.6%
Space Separator 42422
 
11.1%
Other Punctuation 6760
 
1.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 51014
17.7%
e 50113
17.4%
n 28902
10.0%
o 28902
10.0%
p 23043
8.0%
d 21211
7.4%
f 21211
7.4%
s 21211
7.4%
c 14451
 
5.0%
r 14451
 
5.0%
Other values (2) 13520
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
N 14451
32.7%
U 14451
32.7%
T 8592
19.4%
S 6760
15.3%
Space Separator
ValueCountFrequency (%)
42422
100.0%
Other Punctuation
ValueCountFrequency (%)
& 6760
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 332283
87.1%
Common 49182
 
12.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 51014
15.4%
e 50113
15.1%
n 28902
8.7%
o 28902
8.7%
p 23043
 
6.9%
d 21211
 
6.4%
f 21211
 
6.4%
s 21211
 
6.4%
N 14451
 
4.3%
c 14451
 
4.3%
Other values (6) 57774
17.4%
Common
ValueCountFrequency (%)
42422
86.3%
& 6760
 
13.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 381465
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 51014
13.4%
e 50113
13.1%
42422
11.1%
n 28902
 
7.6%
o 28902
 
7.6%
p 23043
 
6.0%
d 21211
 
5.6%
f 21211
 
5.6%
s 21211
 
5.6%
N 14451
 
3.8%
Other values (8) 78985
20.7%

Tire_Size
Categorical

Distinct17
Distinct (%)< 0.1%
Missing284993
Missing (%)76.4%
Memory size2.8 MiB
None or Unspecified
42876 
20.5
14518 
14"
8165 
23.5
8052 
26.5
 
4091
Other values (12)
10220 

Length

Max length19
Median length7
Mean length11.249039
Min length3

Characters and Unicode

Total characters989038
Distinct characters25
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 42876
 
11.5%
20.5 14518
 
3.9%
14" 8165
 
2.2%
23.5 8052
 
2.2%
26.5 4091
 
1.1%
17.5 3551
 
1.0%
29.5 2390
 
0.6%
17.5" 1665
 
0.4%
20.5" 668
 
0.2%
13" 655
 
0.2%
Other values (7) 1291
 
0.3%
(Missing) 284993
76.4%

Length

2023-06-15T17:25:42.674038image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
none 42876
24.7%
or 42876
24.7%
unspecified 42876
24.7%
20.5 15186
 
8.7%
23.5 8335
 
4.8%
14 8165
 
4.7%
17.5 5216
 
3.0%
26.5 4091
 
2.4%
29.5 2390
 
1.4%
15.5 938
 
0.5%
Other values (5) 728
 
0.4%

Most occurring characters

ValueCountFrequency (%)
e 128628
13.0%
n 85755
 
8.7%
85755
 
8.7%
i 85755
 
8.7%
o 85752
 
8.7%
c 42879
 
4.3%
N 42876
 
4.3%
f 42876
 
4.3%
d 42876
 
4.3%
p 42876
 
4.3%
Other values (15) 303010
30.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 643152
65.0%
Decimal Number 126264
 
12.8%
Space Separator 85755
 
8.7%
Uppercase Letter 85752
 
8.7%
Other Punctuation 48115
 
4.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 128628
20.0%
n 85755
13.3%
i 85755
13.3%
o 85752
13.3%
c 42879
 
6.7%
f 42876
 
6.7%
d 42876
 
6.7%
p 42876
 
6.7%
s 42876
 
6.7%
r 42876
 
6.7%
Decimal Number
ValueCountFrequency (%)
5 37094
29.4%
2 30018
23.8%
0 15240
12.1%
1 15001
11.9%
3 9006
 
7.1%
4 8165
 
6.5%
7 5259
 
4.2%
6 4091
 
3.2%
9 2390
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
N 42876
50.0%
U 42876
50.0%
Other Punctuation
ValueCountFrequency (%)
. 36215
75.3%
" 11900
 
24.7%
Space Separator
ValueCountFrequency (%)
85755
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 728904
73.7%
Common 260134
 
26.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 128628
17.6%
n 85755
11.8%
i 85755
11.8%
o 85752
11.8%
c 42879
 
5.9%
N 42876
 
5.9%
f 42876
 
5.9%
d 42876
 
5.9%
p 42876
 
5.9%
s 42876
 
5.9%
Other values (3) 85755
11.8%
Common
ValueCountFrequency (%)
85755
33.0%
5 37094
14.3%
. 36215
13.9%
2 30018
 
11.5%
0 15240
 
5.9%
1 15001
 
5.8%
" 11900
 
4.6%
3 9006
 
3.5%
4 8165
 
3.1%
7 5259
 
2.0%
Other values (2) 6481
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 989038
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 128628
13.0%
n 85755
 
8.7%
85755
 
8.7%
i 85755
 
8.7%
o 85752
 
8.7%
c 42879
 
4.3%
N 42876
 
4.3%
f 42876
 
4.3%
d 42876
 
4.3%
p 42876
 
4.3%
Other values (15) 303010
30.6%

Coupler
Categorical

IMBALANCE  MISSING 

Distinct3
Distinct (%)< 0.1%
Missing176136
Missing (%)47.2%
Memory size2.8 MiB
None or Unspecified
168684 
Manual
22297 
Hydraulic
 
5798

Length

Max length19
Median length19
Mean length17.232327
Min length6

Characters and Unicode

Total characters3390960
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 168684
45.2%
Manual 22297
 
6.0%
Hydraulic 5798
 
1.6%
(Missing) 176136
47.2%

Length

2023-06-15T17:25:42.765724image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:42.860024image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 168684
31.6%
or 168684
31.6%
unspecified 168684
31.6%
manual 22297
 
4.2%
hydraulic 5798
 
1.1%

Most occurring characters

ValueCountFrequency (%)
e 506052
14.9%
n 359665
10.6%
i 343166
10.1%
o 337368
9.9%
337368
9.9%
c 174482
 
5.1%
r 174482
 
5.1%
d 174482
 
5.1%
f 168684
 
5.0%
N 168684
 
5.0%
Other values (9) 646527
19.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2688129
79.3%
Uppercase Letter 365463
 
10.8%
Space Separator 337368
 
9.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 506052
18.8%
n 359665
13.4%
i 343166
12.8%
o 337368
12.6%
c 174482
 
6.5%
r 174482
 
6.5%
d 174482
 
6.5%
f 168684
 
6.3%
p 168684
 
6.3%
s 168684
 
6.3%
Other values (4) 112380
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
N 168684
46.2%
U 168684
46.2%
M 22297
 
6.1%
H 5798
 
1.6%
Space Separator
ValueCountFrequency (%)
337368
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3053592
90.1%
Common 337368
 
9.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 506052
16.6%
n 359665
11.8%
i 343166
11.2%
o 337368
11.0%
c 174482
 
5.7%
r 174482
 
5.7%
d 174482
 
5.7%
f 168684
 
5.5%
N 168684
 
5.5%
p 168684
 
5.5%
Other values (8) 477843
15.6%
Common
ValueCountFrequency (%)
337368
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3390960
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 506052
14.9%
n 359665
10.6%
i 343166
10.1%
o 337368
9.9%
337368
9.9%
c 174482
 
5.1%
r 174482
 
5.1%
d 174482
 
5.1%
f 168684
 
5.0%
N 168684
 
5.0%
Other values (9) 646527
19.1%

Coupler_System
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing332096
Missing (%)89.1%
Memory size2.8 MiB
None or Unspecified
37765 
Yes
 
3054

Length

Max length19
Median length19
Mean length17.80291
Min length3

Characters and Unicode

Total characters726697
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 37765
 
10.1%
Yes 3054
 
0.8%
(Missing) 332096
89.1%

Length

2023-06-15T17:25:42.944070image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:43.037297image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 37765
32.5%
or 37765
32.5%
unspecified 37765
32.5%
yes 3054
 
2.6%

Most occurring characters

ValueCountFrequency (%)
e 116349
16.0%
o 75530
10.4%
n 75530
10.4%
75530
10.4%
i 75530
10.4%
s 40819
 
5.6%
N 37765
 
5.2%
r 37765
 
5.2%
U 37765
 
5.2%
p 37765
 
5.2%
Other values (4) 116349
16.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 572583
78.8%
Uppercase Letter 78584
 
10.8%
Space Separator 75530
 
10.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 116349
20.3%
o 75530
13.2%
n 75530
13.2%
i 75530
13.2%
s 40819
 
7.1%
r 37765
 
6.6%
p 37765
 
6.6%
c 37765
 
6.6%
f 37765
 
6.6%
d 37765
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
N 37765
48.1%
U 37765
48.1%
Y 3054
 
3.9%
Space Separator
ValueCountFrequency (%)
75530
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 651167
89.6%
Common 75530
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 116349
17.9%
o 75530
11.6%
n 75530
11.6%
i 75530
11.6%
s 40819
 
6.3%
N 37765
 
5.8%
r 37765
 
5.8%
U 37765
 
5.8%
p 37765
 
5.8%
c 37765
 
5.8%
Other values (3) 78584
12.1%
Common
ValueCountFrequency (%)
75530
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 726697
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 116349
16.0%
o 75530
10.4%
n 75530
10.4%
75530
10.4%
i 75530
10.4%
s 40819
 
5.6%
N 37765
 
5.2%
r 37765
 
5.2%
U 37765
 
5.2%
p 37765
 
5.2%
Other values (4) 116349
16.0%

Grouser_Tracks
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing332180
Missing (%)89.1%
Memory size2.8 MiB
None or Unspecified
37882 
Yes
 
2853

Length

Max length19
Median length19
Mean length17.879391
Min length3

Characters and Unicode

Total characters728317
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 37882
 
10.2%
Yes 2853
 
0.8%
(Missing) 332180
89.1%

Length

2023-06-15T17:25:43.116954image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:43.210546image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 37882
32.5%
or 37882
32.5%
unspecified 37882
32.5%
yes 2853
 
2.4%

Most occurring characters

ValueCountFrequency (%)
e 116499
16.0%
o 75764
10.4%
n 75764
10.4%
75764
10.4%
i 75764
10.4%
s 40735
 
5.6%
N 37882
 
5.2%
r 37882
 
5.2%
U 37882
 
5.2%
p 37882
 
5.2%
Other values (4) 116499
16.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 573936
78.8%
Uppercase Letter 78617
 
10.8%
Space Separator 75764
 
10.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 116499
20.3%
o 75764
13.2%
n 75764
13.2%
i 75764
13.2%
s 40735
 
7.1%
r 37882
 
6.6%
p 37882
 
6.6%
c 37882
 
6.6%
f 37882
 
6.6%
d 37882
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
N 37882
48.2%
U 37882
48.2%
Y 2853
 
3.6%
Space Separator
ValueCountFrequency (%)
75764
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 652553
89.6%
Common 75764
 
10.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 116499
17.9%
o 75764
11.6%
n 75764
11.6%
i 75764
11.6%
s 40735
 
6.2%
N 37882
 
5.8%
r 37882
 
5.8%
U 37882
 
5.8%
p 37882
 
5.8%
c 37882
 
5.8%
Other values (3) 78617
12.0%
Common
ValueCountFrequency (%)
75764
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 728317
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 116499
16.0%
o 75764
10.4%
n 75764
10.4%
75764
10.4%
i 75764
10.4%
s 40735
 
5.6%
N 37882
 
5.2%
r 37882
 
5.2%
U 37882
 
5.2%
p 37882
 
5.2%
Other values (4) 116499
16.0%

Hydraulics_Flow
Categorical

IMBALANCE  MISSING 

Distinct3
Distinct (%)< 0.1%
Missing332180
Missing (%)89.1%
Memory size2.8 MiB
Standard
40147 
High Flow
 
562
None or Unspecified
 
26

Length

Max length19
Median length8
Mean length8.0208175
Min length8

Characters and Unicode

Total characters326728
Distinct characters22
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStandard
2nd rowStandard
3rd rowStandard
4th rowStandard
5th rowStandard

Common Values

ValueCountFrequency (%)
Standard 40147
 
10.8%
High Flow 562
 
0.2%
None or Unspecified 26
 
< 0.1%
(Missing) 332180
89.1%

Length

2023-06-15T17:25:43.285954image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:43.376070image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
standard 40147
97.1%
high 562
 
1.4%
flow 562
 
1.4%
none 26
 
0.1%
or 26
 
0.1%
unspecified 26
 
0.1%

Most occurring characters

ValueCountFrequency (%)
d 80320
24.6%
a 80294
24.6%
n 40199
12.3%
r 40173
12.3%
S 40147
12.3%
t 40147
12.3%
i 614
 
0.2%
o 614
 
0.2%
614
 
0.2%
w 562
 
0.2%
Other values (12) 3044
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 284791
87.2%
Uppercase Letter 41323
 
12.6%
Space Separator 614
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
d 80320
28.2%
a 80294
28.2%
n 40199
14.1%
r 40173
14.1%
t 40147
14.1%
i 614
 
0.2%
o 614
 
0.2%
w 562
 
0.2%
l 562
 
0.2%
h 562
 
0.2%
Other values (6) 744
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
S 40147
97.2%
F 562
 
1.4%
H 562
 
1.4%
N 26
 
0.1%
U 26
 
0.1%
Space Separator
ValueCountFrequency (%)
614
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 326114
99.8%
Common 614
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
d 80320
24.6%
a 80294
24.6%
n 40199
12.3%
r 40173
12.3%
S 40147
12.3%
t 40147
12.3%
i 614
 
0.2%
o 614
 
0.2%
w 562
 
0.2%
l 562
 
0.2%
Other values (11) 2482
 
0.8%
Common
ValueCountFrequency (%)
614
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 326728
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
d 80320
24.6%
a 80294
24.6%
n 40199
12.3%
r 40173
12.3%
S 40147
12.3%
t 40147
12.3%
i 614
 
0.2%
o 614
 
0.2%
614
 
0.2%
w 562
 
0.2%
Other values (12) 3044
 
0.9%

Track_Type
Categorical

Distinct2
Distinct (%)< 0.1%
Missing283243
Missing (%)76.0%
Memory size2.8 MiB
Steel
77300 
Rubber
12372 

Length

Max length6
Median length5
Mean length5.1379695
Min length5

Characters and Unicode

Total characters460732
Distinct characters8
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSteel
2nd rowSteel
3rd rowSteel
4th rowSteel
5th rowSteel

Common Values

ValueCountFrequency (%)
Steel 77300
 
20.7%
Rubber 12372
 
3.3%
(Missing) 283243
76.0%

Length

2023-06-15T17:25:43.456139image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:43.544722image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
steel 77300
86.2%
rubber 12372
 
13.8%

Most occurring characters

ValueCountFrequency (%)
e 166972
36.2%
S 77300
16.8%
t 77300
16.8%
l 77300
16.8%
b 24744
 
5.4%
R 12372
 
2.7%
u 12372
 
2.7%
r 12372
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 371060
80.5%
Uppercase Letter 89672
 
19.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 166972
45.0%
t 77300
20.8%
l 77300
20.8%
b 24744
 
6.7%
u 12372
 
3.3%
r 12372
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
S 77300
86.2%
R 12372
 
13.8%

Most occurring scripts

ValueCountFrequency (%)
Latin 460732
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 166972
36.2%
S 77300
16.8%
t 77300
16.8%
l 77300
16.8%
b 24744
 
5.4%
R 12372
 
2.7%
u 12372
 
2.7%
r 12372
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 460732
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 166972
36.2%
S 77300
16.8%
t 77300
16.8%
l 77300
16.8%
b 24744
 
5.4%
R 12372
 
2.7%
u 12372
 
2.7%
r 12372
 
2.7%

Undercarriage_Pad_Width
Categorical

IMBALANCE  MISSING 

Distinct19
Distinct (%)< 0.1%
Missing282723
Missing (%)75.8%
Memory size2.8 MiB
None or Unspecified
72128 
32 inch
 
4731
28 inch
 
2845
24 inch
 
2612
20 inch
 
2266
Other values (14)
 
5610

Length

Max length19
Median length19
Mean length16.596638
Min length7

Characters and Unicode

Total characters1496884
Distinct characters24
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row24 inch
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th row30 inch
5th row30 inch

Common Values

ValueCountFrequency (%)
None or Unspecified 72128
 
19.3%
32 inch 4731
 
1.3%
28 inch 2845
 
0.8%
24 inch 2612
 
0.7%
20 inch 2266
 
0.6%
30 inch 1455
 
0.4%
36 inch 1373
 
0.4%
18 inch 1225
 
0.3%
34 inch 497
 
0.1%
16 inch 402
 
0.1%
Other values (9) 658
 
0.2%
(Missing) 282723
75.8%

Length

2023-06-15T17:25:43.623761image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
none 72128
28.6%
or 72128
28.6%
unspecified 72128
28.6%
inch 18064
 
7.2%
32 4731
 
1.9%
28 2845
 
1.1%
24 2612
 
1.0%
20 2266
 
0.9%
30 1455
 
0.6%
36 1373
 
0.5%
Other values (12) 2782
 
1.1%

Most occurring characters

ValueCountFrequency (%)
e 216384
14.5%
n 162320
10.8%
162320
10.8%
i 162320
10.8%
o 144256
9.6%
c 90192
 
6.0%
N 72128
 
4.8%
f 72128
 
4.8%
d 72128
 
4.8%
p 72128
 
4.8%
Other values (14) 270580
18.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1154176
77.1%
Space Separator 162320
 
10.8%
Uppercase Letter 144256
 
9.6%
Decimal Number 36130
 
2.4%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 216384
18.7%
n 162320
14.1%
i 162320
14.1%
o 144256
12.5%
c 90192
7.8%
f 72128
 
6.2%
d 72128
 
6.2%
p 72128
 
6.2%
s 72128
 
6.2%
r 72128
 
6.2%
Decimal Number
ValueCountFrequency (%)
2 12909
35.7%
3 8389
23.2%
8 4070
 
11.3%
0 3721
 
10.3%
4 3156
 
8.7%
1 1869
 
5.2%
6 1859
 
5.1%
7 119
 
0.3%
5 38
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
N 72128
50.0%
U 72128
50.0%
Space Separator
ValueCountFrequency (%)
162320
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1298432
86.7%
Common 198452
 
13.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 216384
16.7%
n 162320
12.5%
i 162320
12.5%
o 144256
11.1%
c 90192
6.9%
N 72128
 
5.6%
f 72128
 
5.6%
d 72128
 
5.6%
p 72128
 
5.6%
s 72128
 
5.6%
Other values (3) 162320
12.5%
Common
ValueCountFrequency (%)
162320
81.8%
2 12909
 
6.5%
3 8389
 
4.2%
8 4070
 
2.1%
0 3721
 
1.9%
4 3156
 
1.6%
1 1869
 
0.9%
6 1859
 
0.9%
7 119
 
0.1%
5 38
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1496884
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 216384
14.5%
n 162320
10.8%
162320
10.8%
i 162320
10.8%
o 144256
9.6%
c 90192
 
6.0%
N 72128
 
4.8%
f 72128
 
4.8%
d 72128
 
4.8%
p 72128
 
4.8%
Other values (14) 270580
18.1%

Stick_Length
Categorical

IMBALANCE  MISSING 

Distinct28
Distinct (%)< 0.1%
Missing283187
Missing (%)75.9%
Memory size2.8 MiB
None or Unspecified
70682 
9' 6"
 
5378
10' 6"
 
3250
11' 0"
 
1383
9' 10"
 
1380
Other values (23)
7655 

Length

Max length19
Median length19
Mean length16.151513
Min length5

Characters and Unicode

Total characters1449243
Distinct characters25
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 70682
 
19.0%
9' 6" 5378
 
1.4%
10' 6" 3250
 
0.9%
11' 0" 1383
 
0.4%
9' 10" 1380
 
0.4%
9' 8" 1367
 
0.4%
9' 7" 1319
 
0.4%
12' 10" 1050
 
0.3%
10' 2" 937
 
0.3%
8' 6" 792
 
0.2%
Other values (18) 2190
 
0.6%
(Missing) 283187
75.9%

Length

2023-06-15T17:25:43.716149image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
none 70682
28.3%
or 70682
28.3%
unspecified 70682
28.3%
9 9617
 
3.8%
6 9464
 
3.8%
10 7813
 
3.1%
8 3312
 
1.3%
11 1674
 
0.7%
2 1473
 
0.6%
12 1449
 
0.6%
Other values (10) 3290
 
1.3%

Most occurring characters

ValueCountFrequency (%)
e 212046
14.6%
160410
11.1%
n 141364
9.8%
o 141364
9.8%
i 141364
9.8%
N 70682
 
4.9%
c 70682
 
4.9%
f 70682
 
4.9%
d 70682
 
4.9%
p 70682
 
4.9%
Other values (15) 299285
20.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1060230
73.2%
Space Separator 160410
 
11.1%
Uppercase Letter 141364
 
9.8%
Decimal Number 49147
 
3.4%
Other Punctuation 38092
 
2.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 212046
20.0%
n 141364
13.3%
o 141364
13.3%
i 141364
13.3%
c 70682
 
6.7%
f 70682
 
6.7%
d 70682
 
6.7%
p 70682
 
6.7%
s 70682
 
6.7%
r 70682
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 12736
25.9%
9 9622
19.6%
6 9464
19.3%
0 9196
18.7%
8 3312
 
6.7%
2 2922
 
5.9%
7 1333
 
2.7%
4 325
 
0.7%
5 168
 
0.3%
3 69
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
N 70682
50.0%
U 70682
50.0%
Other Punctuation
ValueCountFrequency (%)
' 19046
50.0%
" 19046
50.0%
Space Separator
ValueCountFrequency (%)
160410
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1201594
82.9%
Common 247649
 
17.1%

Most frequent character per script

Common
ValueCountFrequency (%)
160410
64.8%
' 19046
 
7.7%
" 19046
 
7.7%
1 12736
 
5.1%
9 9622
 
3.9%
6 9464
 
3.8%
0 9196
 
3.7%
8 3312
 
1.3%
2 2922
 
1.2%
7 1333
 
0.5%
Other values (3) 562
 
0.2%
Latin
ValueCountFrequency (%)
e 212046
17.6%
n 141364
11.8%
o 141364
11.8%
i 141364
11.8%
N 70682
 
5.9%
c 70682
 
5.9%
f 70682
 
5.9%
d 70682
 
5.9%
p 70682
 
5.9%
s 70682
 
5.9%
Other values (2) 141364
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1449243
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 212046
14.6%
160410
11.1%
n 141364
9.8%
o 141364
9.8%
i 141364
9.8%
N 70682
 
4.9%
c 70682
 
4.9%
f 70682
 
4.9%
d 70682
 
4.9%
p 70682
 
4.9%
Other values (15) 299285
20.7%

Thumb
Categorical

Distinct3
Distinct (%)< 0.1%
Missing283129
Missing (%)75.9%
Memory size2.8 MiB
None or Unspecified
74112 
Manual
8654 
Hydraulic
 
7020

Length

Max length19
Median length19
Mean length16.965139
Min length6

Characters and Unicode

Total characters1523232
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 74112
 
19.9%
Manual 8654
 
2.3%
Hydraulic 7020
 
1.9%
(Missing) 283129
75.9%

Length

2023-06-15T17:25:43.810877image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:43.933832image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 74112
31.1%
or 74112
31.1%
unspecified 74112
31.1%
manual 8654
 
3.6%
hydraulic 7020
 
2.9%

Most occurring characters

ValueCountFrequency (%)
e 222336
14.6%
n 156878
10.3%
i 155244
10.2%
o 148224
9.7%
148224
9.7%
c 81132
 
5.3%
r 81132
 
5.3%
d 81132
 
5.3%
f 74112
 
4.9%
N 74112
 
4.9%
Other values (9) 300706
19.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1211110
79.5%
Uppercase Letter 163898
 
10.8%
Space Separator 148224
 
9.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 222336
18.4%
n 156878
13.0%
i 155244
12.8%
o 148224
12.2%
c 81132
 
6.7%
r 81132
 
6.7%
d 81132
 
6.7%
f 74112
 
6.1%
p 74112
 
6.1%
s 74112
 
6.1%
Other values (4) 62696
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
N 74112
45.2%
U 74112
45.2%
M 8654
 
5.3%
H 7020
 
4.3%
Space Separator
ValueCountFrequency (%)
148224
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1375008
90.3%
Common 148224
 
9.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 222336
16.2%
n 156878
11.4%
i 155244
11.3%
o 148224
10.8%
c 81132
 
5.9%
r 81132
 
5.9%
d 81132
 
5.9%
f 74112
 
5.4%
N 74112
 
5.4%
p 74112
 
5.4%
Other values (8) 226594
16.5%
Common
ValueCountFrequency (%)
148224
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1523232
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 222336
14.6%
n 156878
10.3%
i 155244
10.2%
o 148224
9.7%
148224
9.7%
c 81132
 
5.3%
r 81132
 
5.3%
d 81132
 
5.3%
f 74112
 
4.9%
N 74112
 
4.9%
Other values (9) 300706
19.7%

Pattern_Changer
Categorical

IMBALANCE  MISSING 

Distinct3
Distinct (%)< 0.1%
Missing283187
Missing (%)75.9%
Memory size2.8 MiB
None or Unspecified
80615 
Yes
9057 
No
 
56

Length

Max length19
Median length19
Mean length17.374376
Min length2

Characters and Unicode

Total characters1558968
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 80615
 
21.6%
Yes 9057
 
2.4%
No 56
 
< 0.1%
(Missing) 283187
75.9%

Length

2023-06-15T17:25:44.044468image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:44.138875image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 80615
32.1%
or 80615
32.1%
unspecified 80615
32.1%
yes 9057
 
3.6%
no 56
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e 250902
16.1%
o 161286
10.3%
n 161230
10.3%
161230
10.3%
i 161230
10.3%
s 89672
 
5.8%
N 80671
 
5.2%
r 80615
 
5.2%
U 80615
 
5.2%
p 80615
 
5.2%
Other values (4) 250902
16.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1227395
78.7%
Uppercase Letter 170343
 
10.9%
Space Separator 161230
 
10.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 250902
20.4%
o 161286
13.1%
n 161230
13.1%
i 161230
13.1%
s 89672
 
7.3%
r 80615
 
6.6%
p 80615
 
6.6%
c 80615
 
6.6%
f 80615
 
6.6%
d 80615
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
N 80671
47.4%
U 80615
47.3%
Y 9057
 
5.3%
Space Separator
ValueCountFrequency (%)
161230
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1397738
89.7%
Common 161230
 
10.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 250902
18.0%
o 161286
11.5%
n 161230
11.5%
i 161230
11.5%
s 89672
 
6.4%
N 80671
 
5.8%
r 80615
 
5.8%
U 80615
 
5.8%
p 80615
 
5.8%
c 80615
 
5.8%
Other values (3) 170287
12.2%
Common
ValueCountFrequency (%)
161230
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1558968
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 250902
16.1%
o 161286
10.3%
n 161230
10.3%
161230
10.3%
i 161230
10.3%
s 89672
 
5.8%
N 80671
 
5.2%
r 80615
 
5.2%
U 80615
 
5.2%
p 80615
 
5.2%
Other values (4) 250902
16.1%

Grouser_Type
Categorical

IMBALANCE  MISSING 

Distinct3
Distinct (%)< 0.1%
Missing283243
Missing (%)76.0%
Memory size2.8 MiB
Double
75866 
Triple
13804 
Single
 
2

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters538032
Distinct characters13
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDouble
2nd rowTriple
3rd rowDouble
4th rowDouble
5th rowDouble

Common Values

ValueCountFrequency (%)
Double 75866
 
20.3%
Triple 13804
 
3.7%
Single 2
 
< 0.1%
(Missing) 283243
76.0%

Length

2023-06-15T17:25:44.218587image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:44.306692image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
double 75866
84.6%
triple 13804
 
15.4%
single 2
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
l 89672
16.7%
e 89672
16.7%
D 75866
14.1%
o 75866
14.1%
u 75866
14.1%
b 75866
14.1%
i 13806
 
2.6%
T 13804
 
2.6%
r 13804
 
2.6%
p 13804
 
2.6%
Other values (3) 6
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 448360
83.3%
Uppercase Letter 89672
 
16.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 89672
20.0%
e 89672
20.0%
o 75866
16.9%
u 75866
16.9%
b 75866
16.9%
i 13806
 
3.1%
r 13804
 
3.1%
p 13804
 
3.1%
n 2
 
< 0.1%
g 2
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
D 75866
84.6%
T 13804
 
15.4%
S 2
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 538032
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 89672
16.7%
e 89672
16.7%
D 75866
14.1%
o 75866
14.1%
u 75866
14.1%
b 75866
14.1%
i 13806
 
2.6%
T 13804
 
2.6%
r 13804
 
2.6%
p 13804
 
2.6%
Other values (3) 6
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 538032
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
l 89672
16.7%
e 89672
16.7%
D 75866
14.1%
o 75866
14.1%
u 75866
14.1%
b 75866
14.1%
i 13806
 
2.6%
T 13804
 
2.6%
r 13804
 
2.6%
p 13804
 
2.6%
Other values (3) 6
 
< 0.1%

Backhoe_Mounting
Categorical

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing299233
Missing (%)80.2%
Memory size2.8 MiB
None or Unspecified
73662 
Yes
 
20

Length

Max length19
Median length19
Mean length18.995657
Min length3

Characters and Unicode

Total characters1399638
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowNone or Unspecified
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 73662
 
19.8%
Yes 20
 
< 0.1%
(Missing) 299233
80.2%

Length

2023-06-15T17:25:44.392940image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:44.487751image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 73662
33.3%
or 73662
33.3%
unspecified 73662
33.3%
yes 20
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e 221006
15.8%
o 147324
10.5%
n 147324
10.5%
147324
10.5%
i 147324
10.5%
s 73682
 
5.3%
N 73662
 
5.3%
r 73662
 
5.3%
U 73662
 
5.3%
p 73662
 
5.3%
Other values (4) 221006
15.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1104970
78.9%
Uppercase Letter 147344
 
10.5%
Space Separator 147324
 
10.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 221006
20.0%
o 147324
13.3%
n 147324
13.3%
i 147324
13.3%
s 73682
 
6.7%
r 73662
 
6.7%
p 73662
 
6.7%
c 73662
 
6.7%
f 73662
 
6.7%
d 73662
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
N 73662
50.0%
U 73662
50.0%
Y 20
 
< 0.1%
Space Separator
ValueCountFrequency (%)
147324
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1252314
89.5%
Common 147324
 
10.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 221006
17.6%
o 147324
11.8%
n 147324
11.8%
i 147324
11.8%
s 73682
 
5.9%
N 73662
 
5.9%
r 73662
 
5.9%
U 73662
 
5.9%
p 73662
 
5.9%
c 73662
 
5.9%
Other values (3) 147344
11.8%
Common
ValueCountFrequency (%)
147324
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1399638
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 221006
15.8%
o 147324
10.5%
n 147324
10.5%
147324
10.5%
i 147324
10.5%
s 73682
 
5.3%
N 73662
 
5.3%
r 73662
 
5.3%
U 73662
 
5.3%
p 73662
 
5.3%
Other values (4) 221006
15.8%

Blade_Type
Categorical

Distinct10
Distinct (%)< 0.1%
Missing298372
Missing (%)80.0%
Memory size2.8 MiB
PAT
35909 
Straight
12500 
None or Unspecified
10263 
Semi U
8306 
VPAT
3620 
Other values (5)
3945 

Length

Max length19
Median length8
Mean length6.4119367
Min length1

Characters and Unicode

Total characters477965
Distinct characters26
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStraight
2nd rowStraight
3rd rowPAT
4th rowStraight
5th rowPAT

Common Values

ValueCountFrequency (%)
PAT 35909
 
9.6%
Straight 12500
 
3.4%
None or Unspecified 10263
 
2.8%
Semi U 8306
 
2.2%
VPAT 3620
 
1.0%
U 1803
 
0.5%
Angle 1560
 
0.4%
No 549
 
0.1%
Landfill 23
 
< 0.1%
Coal 10
 
< 0.1%
(Missing) 298372
80.0%

Length

2023-06-15T17:25:44.566736image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:44.679999image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
pat 35909
34.7%
straight 12500
 
12.1%
none 10263
 
9.9%
or 10263
 
9.9%
unspecified 10263
 
9.9%
u 10109
 
9.8%
semi 8306
 
8.0%
vpat 3620
 
3.5%
angle 1560
 
1.5%
no 549
 
0.5%
Other values (2) 33
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
i 41355
 
8.7%
A 41089
 
8.6%
e 40655
 
8.5%
P 39529
 
8.3%
T 39529
 
8.3%
28832
 
6.0%
t 25000
 
5.2%
r 22763
 
4.8%
n 22109
 
4.6%
o 21085
 
4.4%
Other values (16) 156019
32.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 273343
57.2%
Uppercase Letter 175790
36.8%
Space Separator 28832
 
6.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 41355
15.1%
e 40655
14.9%
t 25000
9.1%
r 22763
8.3%
n 22109
8.1%
o 21085
7.7%
g 14060
 
5.1%
a 12533
 
4.6%
h 12500
 
4.6%
f 10286
 
3.8%
Other values (6) 50997
18.7%
Uppercase Letter
ValueCountFrequency (%)
A 41089
23.4%
P 39529
22.5%
T 39529
22.5%
S 20806
11.8%
U 20372
11.6%
N 10812
 
6.2%
V 3620
 
2.1%
L 23
 
< 0.1%
C 10
 
< 0.1%
Space Separator
ValueCountFrequency (%)
28832
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 449133
94.0%
Common 28832
 
6.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 41355
 
9.2%
A 41089
 
9.1%
e 40655
 
9.1%
P 39529
 
8.8%
T 39529
 
8.8%
t 25000
 
5.6%
r 22763
 
5.1%
n 22109
 
4.9%
o 21085
 
4.7%
S 20806
 
4.6%
Other values (15) 135213
30.1%
Common
ValueCountFrequency (%)
28832
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 477965
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 41355
 
8.7%
A 41089
 
8.6%
e 40655
 
8.5%
P 39529
 
8.3%
T 39529
 
8.3%
28832
 
6.0%
t 25000
 
5.2%
r 22763
 
4.8%
n 22109
 
4.6%
o 21085
 
4.4%
Other values (16) 156019
32.6%

Travel_Controls
Categorical

IMBALANCE  MISSING 

Distinct7
Distinct (%)< 0.1%
Missing298367
Missing (%)80.0%
Memory size2.8 MiB
None or Unspecified
65126 
Differential Steer
 
4998
Finger Tip
 
2641
2 Pedal
 
847
Lever
 
524
Other values (2)
 
412

Length

Max length19
Median length19
Mean length18.302235
Min length5

Characters and Unicode

Total characters1364395
Distinct characters26
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNone or Unspecified
2nd rowNone or Unspecified
3rd rowLever
4th rowNone or Unspecified
5th rowNone or Unspecified

Common Values

ValueCountFrequency (%)
None or Unspecified 65126
 
17.5%
Differential Steer 4998
 
1.3%
Finger Tip 2641
 
0.7%
2 Pedal 847
 
0.2%
Lever 524
 
0.1%
Pedal 403
 
0.1%
1 Speed 9
 
< 0.1%
(Missing) 298367
80.0%

Length

2023-06-15T17:25:44.800286image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:44.906152image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
none 65126
30.5%
or 65126
30.5%
unspecified 65126
30.5%
differential 4998
 
2.3%
steer 4998
 
2.3%
finger 2641
 
1.2%
tip 2641
 
1.2%
pedal 1250
 
0.6%
2 847
 
0.4%
lever 524
 
0.2%
Other values (2) 18
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e 220327
16.1%
i 145530
10.7%
138747
10.2%
n 137891
10.1%
o 130252
9.5%
r 78287
 
5.7%
f 75122
 
5.5%
p 67776
 
5.0%
d 66385
 
4.9%
N 65126
 
4.8%
Other values (16) 238952
17.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1077479
79.0%
Uppercase Letter 147313
 
10.8%
Space Separator 138747
 
10.2%
Decimal Number 856
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 220327
20.4%
i 145530
13.5%
n 137891
12.8%
o 130252
12.1%
r 78287
 
7.3%
f 75122
 
7.0%
p 67776
 
6.3%
d 66385
 
6.2%
c 65126
 
6.0%
s 65126
 
6.0%
Other values (5) 25657
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
N 65126
44.2%
U 65126
44.2%
S 5007
 
3.4%
D 4998
 
3.4%
F 2641
 
1.8%
T 2641
 
1.8%
P 1250
 
0.8%
L 524
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 847
98.9%
1 9
 
1.1%
Space Separator
ValueCountFrequency (%)
138747
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1224792
89.8%
Common 139603
 
10.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 220327
18.0%
i 145530
11.9%
n 137891
11.3%
o 130252
10.6%
r 78287
 
6.4%
f 75122
 
6.1%
p 67776
 
5.5%
d 66385
 
5.4%
N 65126
 
5.3%
c 65126
 
5.3%
Other values (13) 172970
14.1%
Common
ValueCountFrequency (%)
138747
99.4%
2 847
 
0.6%
1 9
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1364395
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 220327
16.1%
i 145530
10.7%
138747
10.2%
n 137891
10.1%
o 130252
9.5%
r 78287
 
5.7%
f 75122
 
5.5%
p 67776
 
5.0%
d 66385
 
4.9%
N 65126
 
4.8%
Other values (16) 238952
17.5%

Differential_Type
Categorical

IMBALANCE  MISSING 

Distinct4
Distinct (%)< 0.1%
Missing308097
Missing (%)82.6%
Memory size2.8 MiB
Standard
63467 
Limited Slip
 
1142
No Spin
 
207
Locking
 
2

Length

Max length12
Median length8
Mean length8.0672498
Min length7

Characters and Unicode

Total characters522903
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowStandard
2nd rowStandard
3rd rowStandard
4th rowStandard
5th rowStandard

Common Values

ValueCountFrequency (%)
Standard 63467
 
17.0%
Limited Slip 1142
 
0.3%
No Spin 207
 
0.1%
Locking 2
 
< 0.1%
(Missing) 308097
82.6%

Length

2023-06-15T17:25:45.001893image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:45.092944image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
standard 63467
95.9%
limited 1142
 
1.7%
slip 1142
 
1.7%
no 207
 
0.3%
spin 207
 
0.3%
locking 2
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
d 128076
24.5%
a 126934
24.3%
S 64816
12.4%
t 64609
12.4%
n 63676
12.2%
r 63467
12.1%
i 3635
 
0.7%
p 1349
 
0.3%
1349
 
0.3%
L 1144
 
0.2%
Other values (8) 3848
 
0.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 455387
87.1%
Uppercase Letter 66167
 
12.7%
Space Separator 1349
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
d 128076
28.1%
a 126934
27.9%
t 64609
14.2%
n 63676
14.0%
r 63467
13.9%
i 3635
 
0.8%
p 1349
 
0.3%
l 1142
 
0.3%
e 1142
 
0.3%
m 1142
 
0.3%
Other values (4) 215
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
S 64816
98.0%
L 1144
 
1.7%
N 207
 
0.3%
Space Separator
ValueCountFrequency (%)
1349
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 521554
99.7%
Common 1349
 
0.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
d 128076
24.6%
a 126934
24.3%
S 64816
12.4%
t 64609
12.4%
n 63676
12.2%
r 63467
12.2%
i 3635
 
0.7%
p 1349
 
0.3%
L 1144
 
0.2%
l 1142
 
0.2%
Other values (7) 2706
 
0.5%
Common
ValueCountFrequency (%)
1349
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 522903
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
d 128076
24.5%
a 126934
24.3%
S 64816
12.4%
t 64609
12.4%
n 63676
12.2%
r 63467
12.1%
i 3635
 
0.7%
p 1349
 
0.3%
1349
 
0.3%
L 1144
 
0.2%
Other values (8) 3848
 
0.7%

Steering_Controls
Categorical

IMBALANCE  MISSING 

Distinct5
Distinct (%)< 0.1%
Missing308133
Missing (%)82.6%
Memory size2.8 MiB
Conventional
64081 
Command Control
 
572
Four Wheel Standard
 
118
Wheel
 
10
No
 
1

Length

Max length19
Median length12
Mean length12.038004
Min length2

Characters and Unicode

Total characters779846
Distinct characters19
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowConventional
2nd rowConventional
3rd rowConventional
4th rowConventional
5th rowConventional

Common Values

ValueCountFrequency (%)
Conventional 64081
 
17.2%
Command Control 572
 
0.2%
Four Wheel Standard 118
 
< 0.1%
Wheel 10
 
< 0.1%
No 1
 
< 0.1%
(Missing) 308133
82.6%

Length

2023-06-15T17:25:45.179130image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-06-15T17:25:45.277079image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
ValueCountFrequency (%)
conventional 64081
97.7%
command 572
 
0.9%
control 572
 
0.9%
wheel 128
 
0.2%
four 118
 
0.2%
standard 118
 
0.2%
no 1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
n 193505
24.8%
o 129997
16.7%
C 65225
 
8.4%
a 64889
 
8.3%
l 64781
 
8.3%
t 64771
 
8.3%
e 64337
 
8.2%
v 64081
 
8.2%
i 64081
 
8.2%
m 1144
 
0.1%
Other values (9) 3035
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 713448
91.5%
Uppercase Letter 65590
 
8.4%
Space Separator 808
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 193505
27.1%
o 129997
18.2%
a 64889
 
9.1%
l 64781
 
9.1%
t 64771
 
9.1%
e 64337
 
9.0%
v 64081
 
9.0%
i 64081
 
9.0%
m 1144
 
0.2%
d 808
 
0.1%
Other values (3) 1054
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
C 65225
99.4%
W 128
 
0.2%
F 118
 
0.2%
S 118
 
0.2%
N 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
808
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 779038
99.9%
Common 808
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 193505
24.8%
o 129997
16.7%
C 65225
 
8.4%
a 64889
 
8.3%
l 64781
 
8.3%
t 64771
 
8.3%
e 64337
 
8.3%
v 64081
 
8.2%
i 64081
 
8.2%
m 1144
 
0.1%
Other values (8) 2227
 
0.3%
Common
ValueCountFrequency (%)
808
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 779846
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 193505
24.8%
o 129997
16.7%
C 65225
 
8.4%
a 64889
 
8.3%
l 64781
 
8.3%
t 64771
 
8.3%
e 64337
 
8.2%
v 64081
 
8.2%
i 64081
 
8.2%
m 1144
 
0.1%
Other values (9) 3035
 
0.4%

saleYear
Real number (ℝ)

Distinct24
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2004.1991
Minimum1989
Maximum2012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:45.361474image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1989
5-th percentile1992
Q12000
median2006
Q32009
95-th percentile2011
Maximum2012
Range23
Interquartile range (IQR)9

Descriptive statistics

Standard deviation5.9089345
Coefficient of variation (CV)0.0029482772
Kurtosis-0.32923359
Mean2004.1991
Median Absolute Deviation (MAD)4
Skewness-0.7868436
Sum7.473959 × 108
Variance34.915507
MonotonicityIncreasing
2023-06-15T17:25:45.450701image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
2009 38247
 
10.3%
2008 36332
 
9.7%
2011 32516
 
8.7%
2007 28991
 
7.8%
2010 28329
 
7.6%
2006 19696
 
5.3%
2005 18100
 
4.9%
2004 18046
 
4.8%
2001 16124
 
4.3%
2002 16062
 
4.3%
Other values (14) 120472
32.3%
ValueCountFrequency (%)
1989 4805
1.3%
1990 4529
 
1.2%
1991 5108
1.4%
1992 5519
1.5%
1993 6303
1.7%
1994 7343
2.0%
1995 7686
2.1%
1996 8179
2.2%
1997 8886
2.4%
1998 11663
3.1%
ValueCountFrequency (%)
2012 10367
 
2.8%
2011 32516
8.7%
2010 28329
7.6%
2009 38247
10.3%
2008 36332
9.7%
2007 28991
7.8%
2006 19696
5.3%
2005 18100
4.9%
2004 18046
4.8%
2003 14016
 
3.8%

saleMonth
Real number (ℝ)

Distinct12
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.3219018
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:45.534984image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median6
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.4441863
Coefficient of variation (CV)0.54480225
Kurtosis-1.2980288
Mean6.3219018
Median Absolute Deviation (MAD)3
Skewness0.21615651
Sum2357532
Variance11.862419
MonotonicityNot monotonic
2023-06-15T17:25:45.610540image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
3 53688
14.4%
2 50824
13.6%
6 42288
11.3%
9 39320
10.5%
12 36311
9.7%
5 28259
7.6%
4 27319
7.3%
10 27075
7.3%
11 23621
6.3%
8 20402
 
5.5%
Other values (2) 23808
6.4%
ValueCountFrequency (%)
1 9924
 
2.7%
2 50824
13.6%
3 53688
14.4%
4 27319
7.3%
5 28259
7.6%
6 42288
11.3%
7 13884
 
3.7%
8 20402
 
5.5%
9 39320
10.5%
10 27075
7.3%
ValueCountFrequency (%)
12 36311
9.7%
11 23621
6.3%
10 27075
7.3%
9 39320
10.5%
8 20402
 
5.5%
7 13884
 
3.7%
6 42288
11.3%
5 28259
7.6%
4 27319
7.3%
3 53688
14.4%

saleDate
Real number (ℝ)

Distinct31
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.121481
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:45.701181image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q19
median16
Q323
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.4435263
Coefficient of variation (CV)0.52374384
Kurtosis-1.0969815
Mean16.121481
Median Absolute Deviation (MAD)7
Skewness-0.022644949
Sum6011942
Variance71.293136
MonotonicityNot monotonic
2023-06-15T17:25:45.799428image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
15 16897
 
4.5%
16 16786
 
4.5%
20 14171
 
3.8%
18 14073
 
3.8%
12 13821
 
3.7%
26 13567
 
3.6%
19 13425
 
3.6%
9 13179
 
3.5%
17 12942
 
3.5%
13 12877
 
3.5%
Other values (21) 231177
62.0%
ValueCountFrequency (%)
1 9496
2.5%
2 9446
2.5%
3 8293
2.2%
4 11891
3.2%
5 10631
2.9%
6 12014
3.2%
7 11862
3.2%
8 11593
3.1%
9 13179
3.5%
10 11059
3.0%
ValueCountFrequency (%)
31 6025
1.6%
30 10357
2.8%
29 12517
3.4%
28 12757
3.4%
27 12468
3.3%
26 13567
3.6%
25 11332
3.0%
24 12120
3.3%
23 11002
3.0%
22 11052
3.0%

saleDayOfWeek
Real number (ℝ)

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5730636
Minimum0
Maximum6
Zeros22103
Zeros (%)5.9%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:45.890969image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4143274
Coefficient of variation (CV)0.5496667
Kurtosis-0.52289016
Mean2.5730636
Median Absolute Deviation (MAD)1
Skewness0.21883444
Sum959534
Variance2.0003219
MonotonicityNot monotonic
2023-06-15T17:25:45.962080image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
3 111315
29.8%
2 83688
22.4%
1 71619
19.2%
4 39925
 
10.7%
5 38696
 
10.4%
0 22103
 
5.9%
6 5569
 
1.5%
ValueCountFrequency (%)
0 22103
 
5.9%
1 71619
19.2%
2 83688
22.4%
3 111315
29.8%
4 39925
 
10.7%
5 38696
 
10.4%
6 5569
 
1.5%
ValueCountFrequency (%)
6 5569
 
1.5%
5 38696
 
10.4%
4 39925
 
10.7%
3 111315
29.8%
2 83688
22.4%
1 71619
19.2%
0 22103
 
5.9%

saleDayOfYear
Real number (ℝ)

Distinct360
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean177.41476
Minimum1
Maximum365
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 MiB
2023-06-15T17:25:46.061509image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile36
Q182
median166
Q3270
95-th percentile344
Maximum365
Range364
Interquartile range (IQR)188

Descriptive statistics

Standard deviation104.10706
Coefficient of variation (CV)0.58680046
Kurtosis-1.3396677
Mean177.41476
Median Absolute Deviation (MAD)94
Skewness0.18468481
Sum66160625
Variance10838.28
MonotonicityNot monotonic
2023-06-15T17:25:46.175387image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46 3579
 
1.0%
40 3238
 
0.9%
88 3224
 
0.9%
85 3210
 
0.9%
35 3113
 
0.8%
89 3070
 
0.8%
86 2783
 
0.7%
39 2767
 
0.7%
44 2758
 
0.7%
47 2690
 
0.7%
Other values (350) 342483
91.8%
ValueCountFrequency (%)
1 1
 
< 0.1%
2 1
 
< 0.1%
3 2
 
< 0.1%
4 7
 
< 0.1%
5 9
 
< 0.1%
6 27
 
< 0.1%
7 94
< 0.1%
8 50
< 0.1%
9 30
 
< 0.1%
10 92
< 0.1%
ValueCountFrequency (%)
365 167
 
< 0.1%
364 37
 
< 0.1%
363 266
 
0.1%
362 574
0.2%
361 34
 
< 0.1%
357 3
 
< 0.1%
356 89
 
< 0.1%
355 46
 
< 0.1%
354 770
0.2%
353 1045
0.3%

Interactions

2023-06-15T17:25:18.231440image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:49.965323image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:52.229073image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:54.673812image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:56.846180image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:59.310902image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:01.656349image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:03.946718image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:06.189199image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:08.155911image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:11.050379image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:13.534276image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:15.892368image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:18.393417image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:50.174362image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:52.398407image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:54.845333image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:57.053647image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:59.478480image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:01.829954image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:04.121783image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:06.322489image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:08.320042image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:11.309753image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:13.710194image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:16.079683image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:18.582365image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:50.352240image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:52.584137image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:55.014316image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:57.277127image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:59.647580image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:02.015033image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:04.298498image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:06.460214image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:08.490087image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:11.558672image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:13.925704image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:16.263716image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:18.744675image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:50.518666image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:52.798202image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:55.176177image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:57.461281image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:59.813261image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:02.192599image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:04.472393image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:06.605887image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:08.768533image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:11.763993image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:14.117828image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:16.442571image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:18.911860image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:50.700226image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:52.976657image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:55.341115image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:57.647993image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:59.982013image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:02.361901image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:04.650433image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:06.769546image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:09.320283image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:11.924736image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:14.291373image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:16.627266image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:19.081467image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:50.881771image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:53.149962image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:55.512901image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:57.839786image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:00.157918image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:02.527142image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:04.829043image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:06.914620image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:09.485964image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:12.117248image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:14.466598image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:16.816172image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:19.248802image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:51.051677image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:53.323778image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:55.682188image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:58.053759image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:00.331545image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:02.707693image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:05.016148image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:07.096646image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:09.694602image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:12.298227image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:14.649714image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:16.995655image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:19.387409image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:51.186651image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:53.464409image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:55.817738image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:58.219576image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:00.469562image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:02.878758image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:05.165656image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:07.268713image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:09.830783image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:12.439076image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:14.788270image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:17.189958image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:19.828635image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:51.375357image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:53.654385image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:56.009168image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:58.421592image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:00.656026image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:03.085284image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:05.361145image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:07.415687image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:10.034594image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:12.658547image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:14.979071image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:17.394952image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:19.991604image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:51.536240image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:53.830773image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:56.173686image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:58.594957image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:00.819067image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:03.247519image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:05.529524image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:07.551032image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:10.197058image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:12.841571image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:15.162539image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:17.567924image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:20.151259image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:51.700146image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:54.006037image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:56.339625image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:58.779577image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:00.981331image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:03.412661image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:05.698852image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:07.688279image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:10.365620image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:13.021562image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:15.340296image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:17.747484image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:20.314847image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:51.883309image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:54.176743image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:56.504567image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:58.962597image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:01.311123image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:03.576100image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:05.873290image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:07.829040image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:10.607692image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:13.200690image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:15.522252image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:17.906792image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:20.472049image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:52.057051image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:54.351331image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:56.672519image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:24:59.137492image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:01.479867image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:03.745069image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:06.048371image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:07.967189image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:10.792320image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:13.377199image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:15.704158image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
2023-06-15T17:25:18.066353image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/

Missing values

2023-06-15T17:25:22.399331image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-06-15T17:25:25.584752image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-06-15T17:25:33.896060image/svg+xmlMatplotlib v3.5.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

SalesIDSalePriceMachineIDModelIDdatasourceauctioneerIDYearMadeMachineHoursCurrentMeterUsageBandfiModelDescfiBaseModelfiSecondaryDescfiModelSeriesfiModelDescriptorProductSizefiProductClassDescstateProductGroupProductGroupDescDrive_SystemEnclosureForksPad_TypeRide_ControlStickTransmissionTurbochargedBlade_ExtensionBlade_WidthEnclosure_TypeEngine_HorsepowerHydraulicsPushblockRipperScarifierTip_ControlTire_SizeCouplerCoupler_SystemGrouser_TracksHydraulics_FlowTrack_TypeUndercarriage_Pad_WidthStick_LengthThumbPattern_ChangerGrouser_TypeBackhoe_MountingBlade_TypeTravel_ControlsDifferential_TypeSteering_ControlssaleYearsaleMonthsaleDatesaleDayOfWeeksaleDayOfYear
016467709500.01126363843413218.01974NaNNaNTD20TD20NaNNaNNaNMediumTrack Type Tractor, Dozer - 105.0 to 130.0 HorsepowerTexasTTTTrack Type TractorsNaNOROPSNaNNaNNaNNaNDirect DriveNaNNaNNaNNaNNaN2 ValveNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNone or UnspecifiedStraightNone or UnspecifiedNaNNaN1989117117
1182151414000.011940891015013299.01980NaNNaNA66A66NaNNaNNaNNaNWheel Loader - 120.0 to 135.0 HorsepowerFloridaWLWheel LoaderNaNOROPSNone or UnspecifiedNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional1989131131
2150513850000.01473654413913299.01978NaNNaND7GD7GNaNNaNLargeTrack Type Tractor, Dozer - 190.0 to 260.0 HorsepowerFloridaTTTTrack Type TractorsNaNOROPSNaNNaNNaNNaNStandardNaNNaNNaNNaNNaN2 ValveNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNone or UnspecifiedStraightNone or UnspecifiedNaNNaN1989131131
3167117416000.01327630859113299.01980NaNNaNA62A62NaNNaNNaNNaNWheel Loader - UnidentifiedFloridaWLWheel LoaderNaNEROPSNone or UnspecifiedNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional1989131131
4132905622000.01336053408913299.01984NaNNaND3BD3BNaNNaNNaNTrack Type Tractor, Dozer - 20.0 to 75.0 HorsepowerFloridaTTTTrack Type TractorsNaNOROPSNaNNaNNaNNaNStandardNaNNaNNaNNaNNaN2 ValveNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNone or UnspecifiedPATLeverNaNNaN1989131131
5130188423500.01182999412313299.01976NaNNaND6CD6CNaNNaNMediumTrack Type Tractor, Dozer - 130.0 to 160.0 HorsepowerFloridaTTTTrack Type TractorsNaNOROPSNaNNaNNaNNaNStandardNaNNaNNaNNaNNaN2 ValveNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNone or UnspecifiedStraightNone or UnspecifiedNaNNaN1989131131
6137922831000.01082797762013299.01986NaNNaNIT12IT12NaNNaNNaNCompactWheel Loader - 60.0 to 80.0 HorsepowerFloridaWLWheel LoaderNaNEROPSYesNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional1989131131
7164539011750.01527216820213299.01970NaNNaN544544NaNNaNNaNNaNWheel Loader - 90.0 to 100.0 HorsepowerFloridaWLWheel LoaderNaNOROPSYesNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional1989131131
8149327963000.01363756275913299.01987NaNNaND5HIID5HIINaNMediumTrack Type Tractor, Dozer - 130.0 to 160.0 HorsepowerFloridaTTTTrack Type TractorsNaNEROPSNaNNaNNaNNaNPowershiftNaNNaNNaNNaNNaN2 ValveNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNone or UnspecifiedPATNone or UnspecifiedNaNNaN1989131131
9144954913000.01289412335613299.01966NaNNaN12F12FNaNNaNNaNMotorgrader - 45.0 to 130.0 HorsepowerFloridaMGMotor GradersNoOROPSNaNNaNNaNNaNNone or UnspecifiedNaNNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNoBase + 1 FunctionNone or UnspecifiedNone or UnspecifiedYesSideshift & TipNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN1989131131
SalesIDSalePriceMachineIDModelIDdatasourceauctioneerIDYearMadeMachineHoursCurrentMeterUsageBandfiModelDescfiBaseModelfiSecondaryDescfiModelSeriesfiModelDescriptorProductSizefiProductClassDescstateProductGroupProductGroupDescDrive_SystemEnclosureForksPad_TypeRide_ControlStickTransmissionTurbochargedBlade_ExtensionBlade_WidthEnclosure_TypeEngine_HorsepowerHydraulicsPushblockRipperScarifierTip_ControlTire_SizeCouplerCoupler_SystemGrouser_TracksHydraulics_FlowTrack_TypeUndercarriage_Pad_WidthStick_LengthThumbPattern_ChangerGrouser_TypeBackhoe_MountingBlade_TypeTravel_ControlsDifferential_TypeSteering_ControlssaleYearsaleMonthsaleDatesaleDayOfWeeksaleDayOfYear
37290563180855500.018736071040214999.02001NaNNaN13001300NaNNaNNaNNaNSkid Steer Loader - 1251.0 to 1351.0 Lb Operating CapacityCaliforniaSSLSkid Steer LoadersNaNOROPSNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedStandardNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN20124285119
372906628363435000.01930870470114999.01998NaNNaN544H544HNaNNaNNaNWheel Loader - 120.0 to 135.0 HorsepowerCaliforniaWLWheel LoaderNaNEROPS w ACNone or UnspecifiedNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaN20.5None or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional20124285119
37290762823377500.019286882207214999.02005NaNNaN317317NaNNaNNaNNaNSkid Steer Loader - 1601.0 to 1751.0 Lb Operating CapacityCaliforniaSSLSkid Steer LoadersNaNOROPSNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedStandardNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN20124285119
372908631132113000.018238461731114999.02005NaNNaNS175S175NaNNaNNaNNaNSkid Steer Loader - 1601.0 to 1751.0 Lb Operating CapacityCaliforniaSSLSkid Steer LoadersNaNOROPSNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedStandardNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN20124285119
372909630647653000.01278794911414999.02004NaNNaNWA320WA320NaNNaNNaNMediumWheel Loader - 150.0 to 175.0 HorsepowerCaliforniaWLWheel LoaderNaNEROPS w ACNone or UnspecifiedNaNYesNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaN20.5None or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional20124285119
372910628121939000.017920491397614999.02000NaNNaN710VHP710NaNNaNVHPNaNMotorgrader - 130.0 to 145.0 HorsepowerCaliforniaMGMotor GradersNoEROPS w ACNaNNaNNaNNaNNone or UnspecifiedNaNNone or Unspecified12'None or UnspecifiedNoBase + 1 FunctionNone or UnspecifiedYesNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN20124285119
372911630298416000.01915521526614999.02001NaNNaND38ED38ENaNNaNNaNTrack Type Tractor, Dozer - 75.0 to 85.0 HorsepowerCaliforniaTTTTrack Type TractorsNaNOROPSNaNNaNNaNNaNStandardNaNNaNNaNNaNNaN2 ValveNaNMulti ShankNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNone or UnspecifiedPATNone or UnspecifiedNaNNaN20124285119
37291263248116000.019191041933014999.02004NaNNaN20642064NaNNaNNaNNaNSkid Steer Loader - 1751.0 to 2201.0 Lb Operating CapacityCaliforniaSSLSkid Steer LoadersNaNOROPSNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedStandardNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN20124285119
372913631302916000.019184161724414999.02004NaNNaN337G337GNaNNaNMiniHydraulic Excavator, Track - 5.0 to 6.0 Metric TonsCaliforniaTEXTrack ExcavatorsNaNEROPS w ACNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN20124285119
372914626625155000.0509560335714999.01993NaNNaN12G12GNaNNaNNaNMotorgrader - 130.0 to 145.0 HorsepowerCaliforniaMGMotor GradersNoOROPSNaNNaNNaNNaNNone or UnspecifiedNaNNone or Unspecified14'None or UnspecifiedNoBase + 1 FunctionYesYesYesNone or UnspecifiedNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN20124285119